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SEQ 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 

amino acid 
sequence 


Amino acid segment containing signal peptide 
IA=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L^Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








Q 


S398 


353 


1306 


HKQMGPLINRCKKILLPTTVPPATMRIWLLGGLLPFLLLLSGLQ 
RPTEGSEVAIKIDFDFAPGSFDDQYQGCSKQVMEKLTQGDYFTK 
DIEAQKNYFRMWQKAHLAWLNQGKVLPQNMTTTHAVAILFYTLN 
SNVHSDFTRAMASVARTPQQYERSFHFKYLHYYLTSAIQLLRKD 
SIMENGTLCYEVHYRTKDVHFNAYTGATIRFGQFLSTSLLKSEA 
QEFGNQTLFTIFTCLGAPVQYFSLKKEVLIPPYELFKVINMSYH 
PRGDWLQLRSTGNLSTYNCQLLKASSKKCIPDPIAIASLSFLTS 
VIIFSKSRV 


6399 


75 


1245 


PNLETYFGRRCEKDSMNFTPTHTPVCRKRTWSKRGVAVSGPTK 
RRGMADSLESTPLPSPEDRLAKLHPSKELLEYYQKKMAECEAEN 
EDLLKKLELYKEACEGQHKLECDLQQREEEIAELQKALSDMQVC 
LFQEREHVLRLYSENDRLRIRELEDKKKIQNLLALVGTDAGEVT 
YFCKEPPHKVTILQKTIQAVGECEQSESSAFKADPKISKRRPSR 
ERKESSEHYQRDIQTLILQVEALQAQLGEQTKLSREQIEGLIED 
RRIHLEEIQVQHQRNQNKIKELTKNLIIHTQELLYESTKDFLQLR 
SENQNKEKS' i L 1 1 i iTR KKKFDKIGKVLPVMHE 
SHHAQSEYIKVMSLCRNEWYFSGRVEGIPKNLQFVM 


6400 


2520 


1053 


KTMKCD2WYEVQSAILRHNCGYAMKTGKFM-1NLMERKDFETWL 
DNISVTFLSLTDLQKNETLDHLISLSGAVQLRHLSNNLETLLKR 
JFLKLLPLELSFYLLKWLDP0TLLTCCLV3KQWNICVTSACTEVW 

otacicnlgwqiddsvqdalhwkkvylkailrmkqledheafets 
slighsarvyalyykdgllctgsddlsaklwdvstgqcvyg:qt 
htcaavkfdeqklvtgsfdntvacwkwssgartqhfrghtgav? 
svdyndeldilvsgsadftvkvwalsagtclntltghtewvtkv 
vlqkckvksllhspgdyillsadkyeikiwpigreinckclktl 
svsedrsiclqprlhfdgkyivcssalglyqwdfasydilrvik 

TPEIANLALLGFGDIFALLFDNRYLYIMDLRTESLISRWPLPEY 
RKSKRGSS FLAGEASWLNGLDGHNDTGLVFATSMPDHS IHLVLW 
KEHG 


6401 


109 


766 


PGAAWSRPDLRGCCTGPQPALRMLVLPSPCPQPLAFSSVETMEG 
PPRRTCRSPEPGPSSSIGSPQASSPPRPNHYLLIDTQGVPYTVL 
VDEESQREPGASGAPGQKKCYSCPVCSRVFEYMSYLQRHSITHS 
EVKPFECDICGKAFKRASHLARHHSIHLACGGRPHGCPLCPRRF 
RDAGELAQHSRVHSCERPFQCPHCPRRFMEQNTLQKHTRWKHP 


6402 


1196 


279 


TTSQCGGIRQSSAIPVASMEFAAI CLRNALLLLPEEQQDPKQEN 
GAKNSNQLGGNTESSESSETCSSKSHDGDKFIPAPPSSPLRKQE 
LENLKCSILACSAYVALALGDNLMALNHADKLLQQPKLSGSLKF 
LGHLYAAEAL1SLDRISDAITHLNPENVTDVSLGISSNEQDQGS 
DKGENEAMESSGKRAPQCYPSSVNSARTVMLFNLGSAYCLRSEY 
DKARKCLHQAASMIHPKEVPPEAILLAVYLELQNGNTQlALQ.il 
KRNQLLPAVKTHSEVRKKPVFQPVHPIQP1QMPAFTTVQRK 


6403 


2 


1690 


RG I HTS VLQGNLQNQM YSHNWIMNLNNLNLTQVQQRNLI TNLQ 
RSVDDTSQAIQRIKNDFQNLQQVFLQAKKDTDWLKEKVQSLQTL 
AANNSALAKANNDTLEDMNSQLNSFTGQMENITTISQANEQNLK 
DLQDLHKDAENRTAIKFNQLEERFQLFETDIVNIISNISYTAHH 
LRTLTSNLNEVRTTCTDTLTKHTDDLTSLKNTLANIRLDSVSLR 
MQQDLMRSRLDTEVANLSVIMEEMKLVDSKHGQLIKNFTILQ3P 
PGPRGPRGDRGSQGPPGPTGNKGQKGEKGEPGPPGPAGERGPIG 
PAGPPGERGGKGSKGSQGPKGSRGSPGKPGPQGPSGDPGPPGPE 
GEEGLPGPQGPPGFQGLQGTVGEPGVPGPRGLPGLPGVPGMPGP 
KGPPGPPGPSGAWPLALQNEPTPAPEDNSCPPHWKNFTDKCYY 
FE 'EI klFEDAKLFCEDKSSHLVFINTREEQQWIKXQMVGRESH 
HIGLTD3ERENEWKWLDGTSPDYKNWKAGQPDNWGHGHGPGEDC 
AGL I YAGQWNDFQCEDVNNFI CEKDRETVLSSAL 


6404 1 


1012 


222 


AAALAMAAPAPGL I S VFS S SQELGAALAQLVAQRAACCLAGARA 



499 



WO 01/53312 



PCT7US00/34263 



Predicted 
iginning 
icleotide 

jrresponding 
to first 
lino acid 
>sidue of 
lino acid 



Predicted end 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 



L=Leucine, M=Methion 
P=Proline, Q=Glutami 
S=Serine, T=Threonin 
W=Tryptophan, Y=Tyro 
Codon, /=possible nu 
\=poss " " 



ne, N=Asparagxne, 

s, V=Valine, 

line, X=Unknown, *= 

:leotide deletion, 



RFALGLSGGSLVSM1ARELPAAVAPAGPASLARWTLGFCDERLV 
PFDHAESTYGLYRTHLLSRLPIPESQVITINPELPVEEAAEDYA 
KKLRQAFQGDSIPVFDLLILGVGPDGHTCSLFPDHPLLQEREKI 
VAPISDSPKPPPQRVTLTLPVLNAARTVIFVATGEGKAAVLKRI 
LEDQEENPLPAALVQPHTGKLCWFLDEAAARLLTVPFEKHSPL 



AALPRPTPRAPLGREGTGSDSEMAASMFYGRLVAVATLRNHRPR 
TAQRAAAQVLGSSGLFNNHGLQVQQQQQRNLSLHEYMSMELLQE 
AGVSVPKGYVAKSPDEAYAIAKKLGSKDWIKAC.VLAGGRGKGT 
FESGLKGGVKIVFSPEEAKAVSSQMIGKKLFTKQTGEKGRICNQ 
VLVCERKYPRREYYFAITMERSFQGPVLIGSSHGGVNIEDVAAE 
TPEAIIKEPIDIEEGIKKEQALQLAQKMGFPPNIVESAAENMVK 
LYSLFLKYDATMIEINPMVEDSDGAVLCMDAKINFDSNSAYRQK 
K1FDLQDWTQEDERDKDAAKANLNYIGLDGNIGCLVMGAGLAMA 
TMDIIKLHGGTPANFLDVGGGATVH2VTEAFKLITSDKKVLAIL 
VN I FGG IMRCDVIAQG IVMAVKDLEI KI P WVRLQGTRVDDAKA 
LIADSGLKILACDDLDEAARMWKLSEIYTLAKQAEVDVKFQLP 



HPRQMxGEDTPEAPPYSSGRYDSIKTEVSGCPEDLTVGRAPTAD 
DDDDDHDDHEDNDKMNDSEGMDPERLKAFNMFVRLFVDENLDRM 
VPISKQPKEKIQAIIESCSRQFPEFQERARKRIRTYLKS CRRMK 
KK"GMEMTRPTPPHLTSAMAENILAA=iCESETRKAAKRMRLEIYQ 
SSQDEPIAT,DKQHSRDSAMTHSTYSLPASSYSQDEVYANGGLN 
YSYRGYGALSSNLQPPASLQTGMHSNGESGEARALASRPAPSWV 
CRAALGSGMGRGKQRPVM3RGCLTA 



VGLCLAVSQTVLAQLDALLVFPGQVAQLS CTLSPQHVTI RDYGV~ 
SWYQQRAGSAPRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNAC 
VLTI S PVQ PEDDADYYCS VGYGFS P 



RGCITSSQAWRLFGGVTRGFNMRIEKCYFCSGPIYPGHGMMFVR 
NDCKVFRFCKSKCHKNFKKKRNPRKVRWTKAFRKAAGKELTVDN 
S FEFEKRRNEP IKYQRELWWKTIDAI1KRVEE IKQKRQAKFIMNR 
LKKNKELQKVQDIKEVKQWIHLIRAPLAC-KGKQLEEKMVQQLQE 
DVDMEDAP 



NTALANLLRCFTCDRLCGGCTAPAPPAIIQGIVLQPVMPSCDPGP 
G PACLPTKTFRS YLFRCHRTYSCVHCRAHLAKHDELI SKSFQGS 
HGRAYLFNSV 



RGGTAGCVACLGCWGQSSSPKAAFPAGSACLPADSCPCLLFQAC 
AISGLFNCITIHPLNIAAGVWMIMNAFILLLCEAPFCCQFIEFA 
NTVAEKVDRLRSWQKAVFYCGMAWPIVISLTLTTLLGNAIAFA 
TGVLYGLSALGKKGDAI5YARIQQQRQQADEEKLAETLEGEL 



RLSIMASSLNEDPEGSRITYVKGDLFACPKTDSLAHCISEDCRM 
GAGIAVLFKKKFGGVQELLNQQKKSGEVAVLKRDGRYIYYLITK 
KRASHKPTYENLQKSLEAMKSHCLKIIGVrDLSMPRIGCGLDRLQ 
WENVSAMIEEVFEATDIKITVYTL 
RPVTSFSPLPGSCGGRLGTRTMLGRSLREVSAALKQGQITPTEL 
CQKCLSLIKKTKFLNAYITVSEEVALKQAEESEKRYKNGQSLGD 
LDGIPIAVKDNFSTSGIETTCASNMLKGYIPPYNATWQKLLDQ 
GALLMGKTNLDEFAMGSGSTDGVFGPVXNPWSYSKQYREKRKQN 
PHSENEDSDWLITGGSSGGSAAAVSAFTCYAALGSDTGGSTRNP 
AAHCGLVGFKPS YGLVSRHGL IPLVNSMDVPGI LTRCVDDAAI V 
LGALAGPDPRDSTTVHEPINKPFMLPSLADVSKLCIGIPKEYLV 
PELSSEVQSLWSKAADLFESEGAKVIEVSLPHTSYSIVCYHVLC 
TSEVASNMARFDGLQYGHRCDIDVSTEAMYAATRREGFNDWRG 
RILSGNFFLLKENYENYFVKAQKVRRLIANDFVNAFNSGVDVLL 
TPTTLSEAVPYLEFIKEDNRTRSAQDDIFTQAVNMAGLPAVSIP 
VALSNQGLPIGLQFIGRAFCDQQLLTVAKWFEKQVQFPVIQLQE 
IMDDCS AVLENE KLAS VS LKQ 
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SEQ 
ID 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 


Predicted end 

nucleotide 

corresponding 
amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 

H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 

Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 


5413 




885 


HEPRCAGMAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIR 
NRLTGIPAGYCFVEFADLATAEKCLHKINGKPLPGATPAKRFKL 
NYATYGKQPDNSPEYSLFVGDLTPDVDDGMLYEFFVKVYPSCRG 
GKWLDQTGVSKGYGFVKFTDELEQKRALTECQGAVGLGSKPVR 
LSVAIPKASRVKPVEYSQMYSYSYNQYYQQYQNYYAQWGYDQNT 
GSYSYSYPQYGYTQSTMQTYEEVGDDALEDPMPQLDVTEANKEF 


6414 




533 


RGGRAALLPWRRFPCCRPRPQPARPSSRATPGPRSPGMATSIGV 
SFSV3DGVPEAEKNAGEPENTYILRPVFQQRFRPSWKDCIHAV 
LKEELANAEYSPEEMPQLTKHLSENIKDKLKEMGFDRYKMWQV 
VIGE2RGEGVFMASRCFWDADTDNYTHDVFMNDSLFCWAAFGC 
FYY 


S415 


2 


116 0 


FVRQWQSS HRRACGLGCE ARAGGGE E PRGRAS S VAG W VGAFRAP 
FIEAAVAGLGAGSGKRRRGWKMPVHSRGDKKETNHHDEMEVDYA 
ENEGSSSEDEDTESSSVSEDGDSSEMDDEDCERRRMECLDEMSN 
LEKQFTDLKDQLYKERLSQVDAKLQEVIAGKAPEYLEPLATLQE 
NMQIRTKVAGIYRELCLESVKNKYECEIQASRQHCESEKLLLYD 
TVQSELEEKIRRLEEDRHSIDITSELWNDELQSRKKRKDPFWPD 
KKKPGWSGPYIVYMLQDLDILEDWTTIRKAMATLGPHRVKTEP 
PVKL2KHLHSARSEEGRLYYDGEWYIRGQT1CIDKKDECPTSAV 
ITTIXHDEVWFKRPDGSKSKLYISQLQKGKYSIKHS 


6416 




1519 


ElAJADLtiiPACAPVLLSRAlafe 1SVT [ [ r I TQEILSHL 
GLASKTAAWGTT.GTI.RTFLNFSVDKnAQRLLRAITGQGVDRSAI 
VDVLTNRSREQRQLISRNFQERTQQDLMKSLQAALSGNLERIVM 
ALLQPTAQFDAQELRTALKASDSAVDVAIEILATRTPPQLQECL 
AVYKHNFQVEAVDGITSETSGILQDLLLALAKGGRDSYSGIIDY 
NLAEQDVQALQRAEGPSREETWVPVFTQRNPEHLIRVFDQYQRS 
TGQELEEAVQNRFHGDAQVALLGLASVIKNTPLYFADKLHQALQ 
ETEPNYQVLIRILISRCETDLLSIRAEFRKKFGKSLYSSLQDAV 
KGDCQSALLALCRAEDM 


6417 


« 


645 


RGESRVLWSELEGEAGGAGGWASSLNARMDNRFATAFVIACVLS 
LISTIYMAASIGTDFWYEYRSPVQENSSDLNKSIWDEFISDEAD 
EKTYNDALFRYNGTVGLWRRCITIPKNMHWYSPPERTESFDWT 
KCVSFTLTEQFMEKFVDPGNHKSGIDLLRTYLWRCQFLLPFVSL 
GLMCFGALIGLCACICRSLYPTIATGILHLLAGLCTLGSVSCYV 
AG J. _ j JLLHQKLE1jPDNVSGEFGWSFCIjACVSAPLQFMASALFIWA 
AHTNRKEYTLMKAYRVA 


6418 


2 


662 


trtrprrppglgaavgkagarststpagaspaaayqadppppah 
tpappppppcggiachgepakfygydnlqrqpifttqqeaelvq 
ypdcksssgnigedpdhlnqssspsqmfpwmrpqaapgrrrgrq 
tyerfqtlelekeflfnpyltrkrrievshalalterqvkiwfq 

I r MKK <~ N 'KFPVSRQEVKDGETKKEAQELEEDRAEGLTN 


6419 


1 


973 


PGRPRVRNFDLWSKSILQEFFCTRSIQIPANRSKTAMSKCPIFP 
MAHSISTSGPLDKEDTGRQKLISTGSLPATLQGATDSLGLEWHL 
PS PDPVTVP YLS PLWWKELESLLENEGDHAI TVADFVDHHP IV 
FWNLWYFRRLDLPSNLPGLILSSEHCNKYSKIPRHCMSEDSKY 
VLIQMLWDNMKLHQDPGQPLYILWNAHTQKYPMVHLLQKSDNSF 
NQE LL KSMVKS I KMNDVYG PMS QI LETLNKCPHFKRQRS LYRE I 
LFLSLVALGRENIDIDAFDKEYKMAYDRLTPSQVKSTHNCDRPP 
STGVHECRKTFGEPYL 


6420 


207 




RKT-IIDKMQTCGVGQDSVPYMICLIHILEEMFGVEQLEDYLNFAN 
YLLWVFTPLILLILPYFTIFLLYLTIIFLHIYKRKHVLKEAYSH 
NLWDGARKTVATLWDGHAAVWHGYE VHGMEKI PEDGPAL IIFYH 
GAIPIDFYYFMAKIFIHKGRTCRWADHFVFKIPGFSLLLDVFC 
ALHGPREKCVEILRSGHLLAISPGGVREAMSDETYNIVWGHRR 
GFAGVAIDAKVPIIPMFTQNIREGFRSLGGTRLFRWLYEKFRYP 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glyclne, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=I,eucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, H=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W»Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








FAPMYGGFPVKLRTYLGDPIPYDPQITAEELAEKTKNAVQAL_D 
KHQRIPGNIMSALLERFH 


6421 




362 


WALSLRRQPERMSNKLLSPHPHSVVLRSEFKMASSPAVLRASRI, 
YQWSLKSSAQFLGSPQLRQVGQI IRVPARMAATLILEPAGRCCW 
DEPVRIAVRGLAPEQPVTLRASLRDEKGALFQAHARYRADTLGE 
LDLERAPALGGSFAGLEPMGLLWALEPEKPLVRLVKRDVRTPLA 
VELEVLDGHDPDPGRLLCQTRHERYFLPPGVRREPVRVGRVRGT 
IiFLPPEPGPFPGIVDMFGTGGGLLEYRASLLAGKGFAVMALAYY 
NYEDLPKTMETLHLEYFEEAMNYLLSHPEVKGPGVGLLGISKGG 
ELCLSMASFLKGITAAWINGSVANVGGTLRYKGETLPPVGVNR 
NRIKVTKDGYADIVDVLNSPLEGPDQKSFIPVERAESTFLFLVG 
QDDHNWKSEFYANEACKRLQAHGRRKPQIICYPETGHYIEPPYF 
PLCRAS LHALVGS P 1 1 WG3E PRAHAMAQVDAW KCLQTFFHKHLG 
GREGTIPSKV 








EGENL l i'WI PGPFL? DVTKGKIFI 
WMKGATTNICYNVLDRNVHEKKLGDKVAFYWEGNEPGETTQITY 
HQLLVQVCQFSNVLRKQG1HKGDRVAIYMPMIPELWAMLACAR 
IGALHSIVFAGFSSESLCERILDSSCSLLITTDAFYRGEKLVNL 
KELADEALQKCQEKGFPVRCCIWKIILGRAELGMGDSTSQSPP1 
KRSCPDVQ I SWNQG I DLWWHELMQEAGDECEPEWCDAEDPLF I L 
YTSGSTGKPKGWHTVGGYMLYVATTFKYVFDFHAEDVFWCTAD 
IGWITGHSYVTYGPLANGATSVLFEGIPTYPDVTJRLWSIVDKYK 
VTKFYTAPTAIRLLMKFGDEPVTKHSRASLQVLGTVGEPINPEA 
WLWYHRWGAQRCPIVDTFWQTETGGHMLTPLPGATPMKPGSAT 
FPFFGVAPAILNESGEELSGEAEGYLVFKQPWPGIMRTVYGNHE 
RFETTYFKKFPGYYVTGDGCQRDQDGYYRITGRIDDMLNVSGHL 
LSTAEVESALVEHEAVAEAAWGHPHPVKGECLYCFVTLCDGHT 
FSPKLTEELKKQIREKIGPIATPDYIQNAPGLPKTRSGKIMRRV 


6423 


614 


1237 


ANLKE IPRDLPPETVLL YLDSNQI TS IPNEI FKDLHQLRVLNLS 
KKGIEFIDEHAFKGVAETLQTLDLSDNRIQSVHKNAFNNLKARA 
R IANNPWHCDCTLQQVLRSMASKHETAHNVI CKTSVLDEHAGRP 
FLNAANDADLCNLPKKTTDYAMLVTMF3WFTMVISYWYYVRQN 
QEDARRHLEYLKSLPSRQKKADEPDDISTW 








GGEGGKGGD VWWAHNRMTL KQLKDRY PRKRFVAG VGANS K I S A 
LKGSKGKDWEI PVPVGI SVTDEMGKI 1" GELNKENDRI LVAQGGL 
GGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSLLSC 
VSHAKPAIADYAFTTLKPELGKIMYSDFKQISVADLPGLIEGAH 

LLTKELELYKEELQTKPALLAVNKMDLPDAQDKFHELMSQLQNP 
KDFLHLFEKNMIPERTVEFQHIIPISAVTGEGIEELKNCIRKSL 
DEQANQENDALHKKQLLNLWISDTMSSTEPPSKHAVTTSKMDI1 


6425 


1850 


1144 


LAKEGGGGIPijETLKEESQSRHVLPASFEVNSLQKSNWGFLLTG 
LVGGTLVAVYAVATPFVTPALRKVCLPFVPATMKQIENWKMLR 
CRRGSLVDIGSGDGRIVIAAAKKGFTAVGYELNPWLVWYSRYRA 
WREGTOGSAKFYISDLWKVTFSQYSNWIFGVPQMMLQLEKKLE 
RELEDDARVIACRFPFPHWTPDHVTGEGIDTVWAYDASTFRGRE 
KRPCTSMHFQLPIQA 


6426 


30 


565 


SRGAAVGGMSVAGGEIRGDTGGEDTAAPGRFSFSPEPTLEDIRR 
LHAEFAAERDWEQFHQPRNLLLALVGEVGELAELFQWKTDGEPG 
PQGWS PRERAALQEELSDVL I YLVAIAARCRVDLPLAVLSKMDI 
NRRRYPAHLARSSSRKYTELPHGAISEDQAVGPADIPCDSTGQT 
ST 


6427 


145 


959 


AASWG P PHVPKAGKMVSWM I CRLWLVFGMLCPAYAS YKAVKTK 
NIREYVRWMMYWIVFALFMAAEIVTDIFISWFPFYYEIKMAFVL 



502 



WO 01/53312 



PCT7US00/34263 



ID 
NO: 


Predicted 
beginning 

location 
corresponding 

amino acid 
residue of 

sequence 


nucleotide 

corresponding 
to first 

residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginina, 
S=Serine, T=Threonine, V=. Valine, 
W=Tryptophan, ¥=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KLLSPYTKGASLLYRKFVHPSLSRHEKEIDAYIVQAKERSYETV 
LSFGKRGLNIAASAAVQAATKSQGAIAGRLRS7SMCDLRSISDA 

VPRAPARPREKPLIRSQSLRWKRKPPVREGTSRSLKVRTRKKT 
VPSDVDS 


5428 


1982 




SGSGGKMEDHQHVPIDIQTSKLLDWLVDRRHCSLKWQSLVLTIR 
EKINAAIQDMPESEEIAQLIjSGSYIHYFHCLRILDLLKGTEAST 
KNIFGRYSSQRMKDWQEIIALYEKDNTYLVELSSLLVRNWYEI 
PSLKKQIAKCQQLQQEYSRKEEECQAGAAEMREQFYHSCKQYGI 
TGENVRGE LLAL VKDLP S Q LAE I GAAAQQ SLG EAI DVYQAS VG F 
VCESPTEQVLPMLRFVQKRGNSTVYEWRTGTEPSWERPHLEEL 
PEQVAEDAIDWGDFGVEAVSEGTDSGISAEAAG1DWGIFPESDS 
KDPGGDGIDWGDDAVALQITVLEAGTQAPEGVARGPDALTLLEY 
TETRNQFLDELMELE1FLAQRAVELSEEADVLSVSQFQLAPAIL 
QGQTKEKMVTMVSVLEDLIGKLTSLQLQHLFMILASPRYVDRVT 
EFLQQKLKQSQLLALKKELMVQKQQEALEEQAAL3PKLDLLDEK 
TKELQKLIEADISKRYSGRPVNLMGTSL 


6429 


3413 


3442 


EPSSWTAAPRGPLAAHPLEAAVQEDDRRALSFDSRIKVFANGTL 

WKSVTDKDAGDYLCVARNKVGDDYWLKVDVVflKPAKIEHKEE 

NDHKVFYGGDLKVDCVATGLPNPEISW3LPDGSLVNSFMQSDDS 

GGRTKRYWFNNGTLYFNEVGMREEGDYTCFAENQVGKDEMRVR 

VKWTAPATIRNKTCLAVQVPYGDWTVACEAKGEPMPKVTWLS 

PTNKVIPTSSEKYQIYQDGTLLIQKAQRSDSGIJYTCLVRNSAGE ' 

DRKTVWIHVNVQPPKINGWPNPITTVREIAAGGSRKLIDCKAEG 

IPTPRVLWAFPEGWLPAPYYGNRITVHGNGSLDIRSLRKSDSV 

QLVCMARNEGGEARLIVQLTVLEPMEKPIFHDPISEKITAMAGH 

TISLNCSAAGTPTPSLVOTLPNGTDLQSGQQLQRFYHKADGMLH 

ISGLSSVDAGAYRCVARNAAGHTERLVSLKVGLKPEANKQYHNL 

VSIINGETLKLPCTPPGAGQGRFSWTLPNGMHLEGPQTLGRVSL 

ITSEPTPVIYTRPGNTVKLNCMAMGIPKADITWELPDKSHLKAG 
VQARL YGKRFLHPQGS LT I QHATQRDAG F YKCMAKN ILGSDSKT 


6430 


1946 


602 


RTRVSTGLRRTLLWSEAVGASSTRGDTGirGSGEGGAGPGGGEG 

YAGYIPPSKEESDWPASGSSSPLRGESAADSDGWDSAPSDLRTI 
QTFVKKAKSSKRRAAQAGPTQPGPPRSTFSRLQAPDSATLLEKM 
KIiKDSLFDLDGPKVASPLSPTSLTHTSRPPAALTPVPLSQGDLS 
HPPRKKDRKNRKLGPGAGAGFGVLRRPRPTPGDGEKRSRIKKSK 
KRKLKKAERGDHLPPPGPPQAPPSDTDSEEEEEEEEEEEEEEMA 
TWGGEAPVPVLPTPPEAPRPPATVHPEGVPPADSESKEVGSTE 
TSQDGDASSSEGEMRVMDEDIMVESGDDSWDLITCYCRKPFAGR 
PMIECSLCGTWIHLSCAKIKKTNVPDFFYCQKCKELRPEARRLG 
GPPKSGEP 


6431 


3 


605 


W NS£ itlLPAYAPYLPCEACAMQDGRKGGAYAGKMEATTAGVGR 
LEEEALRRKERLKALREKTGRKDKEEGEPKTKHLREEEEEGEKH 
RELRLRNYVPEDEDLKKRRVPQAKPVAVEEKVKEQLEAAKPEPV 
IEEVDLANLAPRKPDWDLKRDVAKKLEKLKKRTQRAIAELIRER 
LKGQEDSLASAVDAATEQKTCDSD 


6432 


56 


1692 


GGLG rMGSRIKQNPETTFEVYVEVAYPRTGGTLSDPEVQRQFPE 
DYSDQEVLQTLTKFCFPFYVDSLTVSQVGQNFTFVLTDIDSKQR 
FGFCRLSSGAKSCFCILSYLPWFEVFYKLLNILADYTTKRQENQ 
KNELLETLHKLPIPDPGVSVHLSVHSYFTVPDTRELPSIPENRN 
LTE Y FVAVDVNNMLHLYASML YERR IL I ICS KLSTLTAC IHGSA 
AMLYPMYWQHVYIPVLPPHLLDYCCAPMPYLIGIHLSLMEKVRN 
MALDDV\'ILNVDTNTLETPFDDLQSLPNDVISSLKMRLKKVSTT 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

to first 3 
amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 

amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine , G=Glycine, 
II=IIistidine, I = Isoleucine, K= Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, VWaline, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TGDGVARAFLKAQAAFFGSYRNALKIEPEEPITFCEEAFVSHYR 
SGAMRQFLQNATQLQLFKQFIDGRLDLLNSGEGFSEVFEEEINM 
GEYAGSDKLYHQWLSTVRKGSGAILNTVKTKANPAMKTVYKFDI 
AENGCAPTPERQI.PKTAPSPLVEAKDPKLREDRRPITVHFGQVR 
PPRPHWKRPKSNIAVEGRRTSVPSPEQNTIATPATLHILQKSI 
THFAAKFPTRGWTSSSH 


S433 


1S24 




APVTKRKEVFAKDSKGSALDAGRDPKRPALPETLCESGWASNTA 
PTTPPQPGWCLCGKDFKSSCQTPGREKERRLATMHGSCSFLMLL 
LPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDML 
HMRWDEELAAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDV 
PIAMEEWHHEREHYNLSAATCSPGQMCGHYTQVVWAKTERIGCG 
SHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEGTPCSQCP 
SGYHCKNSLCEPIGSPEDAQDLEYLVTEAPSFRATEASDSRKMG 
AEGPDKPSWSGLNSGPGHVWGFLLGLLLLPPLVLAGIF 


6434 


40 


2002 


MWLNFGMADPTQMGGLSMLLLAGEHALGTPEVFS j JV3E 
SPELRQKSPLFQFAEISSSTSHSDASTKQCUTSALFQFAEISSN 
TSQLGGAEPVKRCGKSALFQLAEMrf i ^RGKKMEESKLIKAKES 
DGGRIKELF.KGKEEKEIKMEKTDETRLQKEAEFEKSAKENLRDS 
KELRNFEALQIDDIMAIKMEDPKEIRKEELEEDHKCSHFPDFSY 
SASSKIIISDVPSRKDHMCHPHCIMIIEDPAAI.NKPEKLKKKKK 
KSKMDRHGNDKSTPKKTCKKRQSSESDIESVIYTIEAVAKGDWG 
IEKLGDTPRKKVRTSSSGKGSILDAKPPKKKVKSREKKKSKEKS 
SDTTKESRPPDFI S I SASKWI SGETPEGI KAEPLTPMEDALPPR 
LSGQAKPEDSDCHRKIETCGSRKSERSCKGALYKTLVSEGMLTS 
LRANVDRGKRSSGKGNSSDHEGCWNKESNTFSQSGTSGSKKFKK 
TKPKEDCLLGSAKLDERFEKKFNSLPQYSPVTFDRKCVPVPRKK 
KKTGNVSSEPTKTSKGSGDKWSNKQLFLDAIHPTEAIFSEDRNT 
MEPVHKVKNIPS I FNTPEPTTTARTFGGQPKEKSKENPDYSPCQ 
DTQRAGYHHEEVLWMTNLMNNCGGVYLKQLRHTAMTNA 


S435 


2227 


657 


ALQRDAAAAYAHPEYEERFLQEETVSQQINSIELLQTRPLALPE 
WKSQRPLQR2VHLRGRPASQPTVIRGITYYKAKVSEEENDIEE 
QQDEFFSGDNGVDLLIEDQLLRHNGLMTSVTRRPAATRQ3HSTA 
VTSDLNARTAPWSSALPQPSTSDPSIANHASVGPTLQTTSVSPD 
PTRESVLQPSPQVPATTVAHTATQQPAAPAPPAVSPREALMEAM 
HTVPVPPTTVRTDSLGKDAPAGRGTTPASPTLSPEEEDDIRNVI 
GRCKDTLSTITGPTTQNTYGRNEGAWMKDPLAKDERIYVTNYYY 
GNTLVEFRNLENFKQGRWSNSYKLPYSWIGTGHWYNGAFYYNR 
AFTRNI I KYDLKQRYVAAWAMLHDVAYEEATPWRWQGHSDVDFA 
VDENGLWLIYPALDDEGFSQEVIVLSKLNAAD^STQKETTWRTG 
LRRNFYGNCFVICGVLYAVDSYNQRNANISYAFDTHTNTQIVPR 
LLFENEYFYTTQIDYNPKDRLLYAWDNGHQVTYHVIFAY 








GACRPPVRQDPDSGPDYEALPAGATVTTHMVAGAVAGILEHCVM 
YPIDCVKTRMQSLQPDPAARYRKVLEALWRIIRTEGLWRPMRGL 
NVTATGAGPAHALYFACYEKLKKTLSDVIHPGGNSHIANGAAGC 
- l HZAAMNPAEWKQRMQMYITSPYHRVTDCVRAVWQNEGAG 
AFYRSYTTQLTMNVPFQAIHFMTYEFLQEHFIIPQRRYNPSSHVL 
SGACAGAVAAAATTPLDVCKTLUITQESLALNSHITGHITGMAS 
AFRTVYQVGGVTAYFRGVQARVIYQIPSTAIAWSVYEFFKYLIT 
KRQEEWRAGK 


6437 


182B 


360 


r-^SPARHVTRTARGHLEGGSRAPPLLQAVFLQIKNMVK 
LIHTLADHGDDVNCCAFSFSLLATCSLDKTIRLYSLRDFTELPH 
SPLKFHTYAVHCCCFSPSGHILASCSTDGTTVLWNTENGQMLAV 
MEQPSGSPVRVCQFSPDSTCLASGAADGTWLWNAQSYKLYRCG 
SVKDGS LAACAFS PNGS FFVTGSS CGDLTVWDDKI>1RCLHS E KAH 
DLC-ITCCDFSSQPVSEGEQGLQFFRLASCGQDCQVKIWIVSFTH 
ILGFELKYKSTLSGHCAPVLACAFSHDGQMLVSGSVDKSVIVYD 



504 



WO 01/53312 



PCT7US00/34263 



ID 
NO: 


Predicted 
beginning 

corresponding 
JJino]cid 


nucleotide 

location 

corresponding 

sequence 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=PhenyIalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








TNTEyiLHTLTQHTRYVTTCAFAPNTLLLATGSMDKTVNIWQFD 
LETLCQARSTEHQLKQFTEDWSEEDVSTKLCAQDLKDLVGIFKM 
NN IDG KELLMLTKES LADDLKI E S LGLRS KVLRKI EELRTKVKS 

RTSPP 


6438 


109 


901 


EVQILRAKMFQTGGLIVFYGLLAQTMAQFGGLPVPLDQTLPLNV 
NPALPLSPTGLAGSLTNALSNGLLSGGLLGILENLPLLDILKPG 
GGTSGGLLGGLLGKVTSVIPGLNNIIDIKVTDPQLLELGLVQSP 
DGHRLYVT IPLGI KLQVNTPLVGAS LLRLAVKLD ITAE ILAVRD 
KQER IHLVLGDCTHS PGS LQI SLLDGL3PLP IQGLLDSLTG ILN 
KVLPELVQGNVCPLVNEVLRGLDITLVHDIVNMLIHGLQFVIKV 


6439 


23 


412 


SIQTASAITTEMASQSQGIQQLLQAEKRAAEKVADARKRKARRL 
RRQVQGMQSSQQRNRERVLAQLLGMVCDVRPQVHPNYRISA 


6440 


3 


517 


RAI WNSDMGDLPGLVRLS IALRIQPND3PVFYKVDG0RFGQNRT 
IKLLTGSSYKVEVKIKPSTLQVENISIGGVLVPLELKSKEPDGD 
RWYTGT YDTEGVTPTKSGERQP I Q ITMPFTDIGTFETVWQVKF 
YNYHKRDHCQWGSPFSVIEYECKPNETRSLMWVNKESFL 








^ r r r i r c r i r ^ T 

RNVCKENSTVGMKIQEELQRSGGLDHLVLSPGEWPVSDNTIMHI 
ATAEALTTDYWCLDDLYREMVRCYVEIVEKLPERRPDPATIEGC 
AQLKPNNYLLAWHTPFNEKGSGFGAATKAMCIGIiRYWKPERLEr 
LIEVSVECGRMTHNHPTGFLGSLCTALFVSFAAQGKPLVQWGRD 
MLRAVP^AEEYCRKTIRIITAEYQEHWFYFEAKWQFYT^EERKISK 
DSENKAIFPDNYDAEEREKTYRKWSSEGR3GRRGHDAPMIAYDA 
LLAAGNSWTELCHRAMFHGGESAATGTIAGCLFGLLYGLDLVPK 
GLYQDLEDKEKLEDLGAALYRLSTEEK 








LQRQSLLDMSLVKLQLCHMLVEPNLCRSVLIANTVRQIQEEMTQ 
DGTWRTVAPQAAERAPLDRLVSTEILCRAWGQEGAHPASGLGD 
GHTQGPVSDLCPVTSAQAPRHLQSSAWEMDGPRENRGSFHKSLD 
QIFETLETKNPSCMEELFSDVDSPYYDLDTVLTGMMGGARPGPC 
EGLEGLAPATPGPSSSCKSDLGELDHWEILVET 


6443 






AVPIPEKMSEWAPRPPPE FVRDVMGS SAGAGSGEFHV YRHLRRR 
EYQRQDYMDAMAEKQKLDAEFQKRLEKNKIAAEEQTAKRRKKRQ 
KLKEKKliLAKKMKLEQKKQEGPGQPKEQGSSSSAEASGTEEEEE 
VPSFTMGR 


6444 


390 


899 


GSTPRGKMRAPIPEPKPGDLIEIFRP FYRHWAI YVGDGY WHLA 
PPSEVAGAGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHD 
DKYSPLPCSKIIQRAEELVGQEVLYKLTSENCEHFVNELRYGVA 
RSDQVRDVIIAASVAGMGLAAMSLIGVMFSRNKRQKQ 


6445 


2 


753 


AGAAGAAGAARS PRPQAHTKGVRGLPSRRRS PDCGRMELAAGS F 
SEEQFWEACAELQQPALAGADWQLLVETSGISIYRLLDKKIGLY 
E -CYLEDCSPTLLADIYMDSDYRKQWDQYVKELYEQECNGE 
TWYWEVKYPFPMSNRDYVYIiRQRRDLDMEGRKIHVILARSTSM 
PQLG3RSGVIRVKQYKQSLAIESDGKKGSKVFMYYFDNPGGQIP 
SWL I N W AA KNGVPNFLKDMARACQNYLKKT 


6446 


1 


1651 


RCPTRSPPPDTPGSRGTTAMCSLASGATGGRGAVENEEDLPELS 
DSGDEAAWEDEDDADLPHGKQQTPCLFCNRLFTSAEETFSHCKS 
EHQFNIDSMVHKHGLEFYGYIKLINFIRLKFPTVEYMNSIYNPV 
E -EYLKPVLEDDLLLQFDVEDLYEPVSVPFSYPKGLSENTS 
WEKLKHMEARALSAEAALARAREDLQKMKQFAQDFVMHTDVRT 
CSSSTSVIADLQEDEDGVYFSSYGHYGIHEEMLKDKIRTESYRD 
FIYQNPK I FKDKWLDVGCGTGILSMFAAKAGAKKVLGTOQSEI 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Ammo acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanirie, G=Glycine, 
H=Histidine, I=lsoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 

S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LYQAMDI IRLNKLEDTITLIKGKIEETOLPVEKVDVIISEKMGY 
FLLFESMLDSVLYAKMKYLAKGGSVYPDICTISLVAVSDVWKHA 
DRIAFWDDVYGFKMSCMKKAVIPEAWEVLDPKTLISEPCGIKH 
IDCHTTSISDLEFSSDFTLKITRTSMCTAIAGYFDIYFEKNCHN 
RWFSTGPQSTKTHWKQTVFLLEKPFSVKAGEALKGKVTVHKNK 
KDPRSLTVTLTLNNSTQTYGLQ 


64 47 


1554 


1068 


RLGPAEWHLSGPCHATLGAANRGRALGVRAAWRGAPLCQRVMMP 
SRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIA 
LATVLFLIGAFLIIIGSLLLSGYISKGGADRAVPVLIIGILVFL 
PGF YHLR I AYYAS KG YRGYS YDD I PDFDD 


6448 


74 


55S 


GQVLSHCYHYRSSRWRRGGLSRGRGAGVMALVPYEET.EFGLQK 
FHKPLATFS FANHT I Q I RQDWRHLGVAAWWDAAIVLSTYLEMG 
AVELRGRSAVELGAGTGLVGIVAALLP.CP.IRYERDNWFLAMLER 
QFIVRKVHYDPEI DYHIYEAQKRNQKEDL 








L ' ll LE IT Rj' A<_,LKF H I LSI ILltJjj 
GLLNWVDGLFIIGWMYLPPHDPHVDEPMRFKPLFRIHLMERKA 
ATVECMYGHKGPHHGHIQIVKKDEFSTKCNQTDHHRMSGGRQEE 
FRTWLREEWGRTLED 1 FHEHMQELI LMKFI YTSQYDNCLTYRRI 
YLPPSRPDDLIKPGLFKGTYGSHGLEIVMLSFHGRRARGTKITG 
D PII I PAGQQTVE IDLRHRI QLPDLENQRNFNELS RIVLEVRERV 

EDGGEPGDAVAAAEQPAQCGQGQPFVLPVGVSSRNEDYPRTCRM 
CFYGTGLIAGHGFTSPERTPGVFILFDEDRFGFVrtLELKSFSLY 
SRVQATFRNADAPSPQAFDEMLKWIQSLTS 


6450 


848 


269 


FVPAPRTVSGKRSLPGEWEERGEGEQRTGREFSGNGGRAVEAAR 

VLGLAGNSFRPEHRALLNAFTATFELSDDGRFEVWNAKTRGQHC 
DTWSYVLIPAAQPGOFTVDHRVivTi EQ 3 ' j] "->GrELVAAS 
RDAGPVHLPGQSSGPLG 


6451 


232 


939 


HSPTPPTSPRASTMEDVKLEFPSLPQCKEDAEEWTYPMRREMQE 

FQQLFRYLVLDIADNPVEKIIRFFPMTKEFIDGSLQMGGKVLVH 
GNAGISRSAAFVIAYIMETFGMKYRDAFAYVQERRFCINPNAGF 
VHQLQEYEAIYLAKLTIQMMSPLQIERSLSVHSGTTGSLKRTH3 
EEDDFGTMQVATAQNG 


5452 






LKFLKPKINSFYAFEVKDAKGRTvSLEKYKGKVSLWNVASDCQ 
LTDRNYLGLKELHKEFGPSHFSVLAFPCNQFGESEPRPSKEVES 
FARKNYGVTFPIFHKIKILGSEGEPAFRFLVDSSKKEPRWNFWK 
YLVNPEGQWKFWRPEEPIEVIRPDIAALVRQVIIKKKEDL 


6453 




223 


HRRNLPGLSMSPRRTLPRPLSLCLSLCLCLCLAAAT.GSAQSGSC 
RDKKNCKWFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTHHK 
DPGIYKCWCGTPLFKSETKFDSGSGWPSFHDVINSEAITFTDD 
FSYGMHRVETSCSQCGAHLGHIFDDGPRPTGKRYCINSAAL3FT 
PADSSGTASGGSGVAS PAQADKAEL 


6454 


827 


223 


HRRWLPGL3MSPRRTLPRPLSLCLSLCLCLCLAAALGSAQSGSC 
RDKKNCKWFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTEHK 
DPGIYKCWCGTPLFKSETKFDSGSGWPSFHDVINSEAITFTDD 
FSYGMHRVETSCSQCGAHLGHIFDDGPRPTGKRYCINSAALSFT 
PADSSGTAEGGSGVASPAQADKAEL 


6455 


1042 


173 


■' > H " TVSASAAWDALGLPVRSHMQGSTRRMGVMTDVHRRFLQL 
U ITH : VLEEWDVKRLQTHCYKVHDRNATVDKLEDFINNIN3VLE 
- r IKRGVTEDDGRPI YALVNLATTS ISKMATDFAENELDLF 
I " ^b_i L 1 1 DSETGFASS TNI LNLVDQLKGKKMRKKEAEQVLQKF 
VQNKWLIEKEGEFTLHGRAILEMEQYIRETYPDAVKICKICHSL 
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SEQ 
ID 


Predicted 
beginning 

corresponding 
to first 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 

H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine r M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 




residue of 
amino acid 


amino acid 
sequence 


W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








LIQGQSCETCGIRMHLPCVAKYFQSNAEPRCPHCNDYWPHEIPK 
VFDPEKERESGVLKSNKKSLRSRQH 


6456 


• 


S55 


RPQSRSISMWRNSLLQVSSGLRWLRVCAMVDILGERHLVTCKGA 
TVEAEAALQNKWALYFAAARCAPSRDFTPLLCDFYTALVAEAR 
RPAPFEWFVSADGSSQEMLDFMRELHGAWLALPFHDPYRHELR 
KRYNVTAIPKLVIWQNGEVITNKGRKQIRERGLACFQDWVEAA 


6457 


23 


892 


PTTGFPVTNFPWNWPDGKPPIMILYVSKLNKIIHFPDFDKKIPV 
KLFPLPLLYVGNHISGLS3TSKLSLPMFTVLRKFTIPLTLLLET 
IILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFVFLND 
I FTAANGVYTKQKMDPKELGKYGVLFYNACFMI IPTL J ISVSTG 
DLQQATEFNQWKNWFILQFLLSCFLGFLLMYSTVLCSYYNSAL 
TTAWGAI KNVS VAY IGI LIGGDYI FSLLNFVGLNICMAGGLRY 
SFLTLSSQLKPKPVGEENICLDLKS 


6453 


23 


832 


PTTGFPVTNFPWNWPDGKPPIMILYVSKLNKIIHFPDFDKKIPV 
KLFPLPLLYVGNHISGLSSTSKLSLPMFTVLRKFTIPLTLLLET 
I ILGKQYSL1TI ILSVFAI ILGAFIAAGSDLAFNLEGYI FVFLND 
I FTAANGV YTKQ KMD P KELGKYGVL F YNACFM IIPTLIISVSTG 
DLQQATEFNQWKNWFILQFLLSCFLGFLLMYSTVLCSYYNSAL 
TTAWGAI KNVSVAYIGILIGGDYI FSLLNFVGLNICMAGGLRY 
SFLTLSSQLKPKPVGEENICLDLKS 


6459 


23 




PTTGFPVTNFPWNWPDGKPPIMILYVSKLNKIIHFPDFDKKIPV 
KLFPLPLLYVGNHISGLSSTSKLSLPMFTVLRKFTIPLTLLLET 
IILGKQYSLNIILSVFAI ILGAFIAAGSDLAFNLEGYI FVFLND 
IFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLI ISVSTG 
DLQQATEFNQWKNWFILQFLLSCFLGFLLMYSTVLCSYYNSAL 
TTAWGAI KNVS VAY IGILIGGDYIFSLLNFVGLNI CKAGGLRY 
SFLTLSSQLKPKPVGEENICLDLKS 


6450 


23 


B92 


PTTGFPVTNFPWNWPDGKPPIMILYVSKLNKIIHJPDFDKKIPV 
KLFPLPLLYVGNHISGLSSTSKLSLPMFTVLRKFTIPLTLLLET 
IILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFVFZjND 
IFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLIISVSTG 
DLQQATE FNQWKNWFI LQFLLSCFLG JLLMYS TVLCSYYNSAL 
TTAWGAIKNVSVAYIGILIGGDYIFSLLNFVGLNICMAGGLRY 
SFLTLSSQLKPKPVGEENICLDLKS 


6461 


1653 


360 


LQQRTLRITAVGQTHPIAWMAWEPSLGAFYGPASFITFVNCMYF 
LSIFIQLKRHPERKYELKEPTEEQQRLAANENGEINgQDSMSLS 
LISTSALENEHTFHSQLLGASLTLLLYVALWMFGALAVSLYYPL 
DLVFS FVFGATS LS FSAFFWHHCVNREDVRLAW IMTCCPGRS S 

GCKLTNLQAAAAQCHANSLPLNSTPQLDNSLTEHSMDNDIKMHV 
APLEVQFRTNVHSSRHHIQJRSKGHRASRLTVLREYAYDVPTSVE 
GSVQNGLPKSRLGNNEGHSRSRRAYLAYRERQYNPPQQDSSDAC 
STLPKSSRNFEKPVSTTSKKDALRKPAWELENQQKSYGLNLAI 
QNGPIKSNGQEGPLLGTDSTGNVRTGLWKHETTV 


6462 




773 


SE3LDREXKLKEDSPRKTPNKESGVPSLPVSLTSIKEEPKEAKH 
PDSQSMEESKLKNDDRKTPVNWKDSRGTRVAVSSPMSQHQSYIQ 
YLHAYPYPQMYDPSHPAYRAVS PVLMHS YPGAYLS PGFHYPVYG 
KMSGREETEKVNTSPSVNTKTTTESKALDLLQQHANQYRSKSPA 
PVEKATAEREREAERERDRHSPFGQRHLHTHHHTHVGMGYPLIP 
GQYDPFQGLTSAALVASQQVAAQASASGMFPGQRRE 


6463 


2 


350 


VILCI LGGW IFKNADRSMEKKKGEPRTRAEARPWVDEDLKDS SD 
LHQAEEDADEWQESEENVEHIPFSHNHYPEKEMVKRSQEFYELL 
NKRRSVRFISNEQVPMEVIDNVIRTAGL 


6464 


12 




G I LRQKERE ERNRIHKKE I LFLEHLLWPSEMSS LSGKVQTVLG 
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SEQ 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

to first 3 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








DVE ^TTGRTLTHEHLAMTFDCCYCPPPPCQEAISKEPIVMIOfL 
YWIQKNAYSHKENLQLNQETEAIKEELLYFKANGGGALVENTTT 
GISRDTQTLKRLAEETGVHIISGAGFYVDATHSSETRAMSVEQL 
TDVLMNEILHGADGTSIKCGIIGEIGCSWPLTESERKVLQATAH 
AQAQLGCPVI IHPGRSSRAPFQI IRILQEAGADISKTVMSHLDR 
T 1 LDKKELLE FAQLGC YLEYDLFGTELLHYQLGPD I DMPDDNKR 

PKMLLRG I TENVLDKI L I ENPKQWLTFK 


5465 


126 


13 96 


KMTVFFKTLRNHWKKTTAGLCLLTWGGHWLYGKHCDJILLRRAAC 
QEAQVFGNQL I PPNAQVKKATVFLNPAACKGKARTLFE KNAAP I 
LHLSGMDVT IVKTDYEGQAKKLLELMENTDVI IVAGGDGTLQEV 
VTGVLRRTDEATFSKIPIGFIPLGETSSLSHTLFAESGNKVQHI 
TDATLAIVKGETVPLDVLQIKGEKEQPVFAMTGLRWGSFRDAGV 
KVSKYWYLEPLKIKAAHFFSTLKEWPQTHQASISYTGPTERPPN 
EPEETPVQRPSLYRRILRRLASYWAQPQDALSQEVSPEVWKDVQ 
LSTIELSITTRNNQLDPTSKEDFLNICIEPDTISKGDFITIGSR 
KVRNPKLHVEGTECLQASQCTLLIPEGAGGSFSIDSEEYEAMFV 
EVRLLPRKLQFFCDPRKR3QMLTSPTQ 


6466 


1134 


828 


VARGTELSQLEKAHPPADMGRRKSKRKPPPKKKM'i'GTLETQFTC 
PPCNHEKS CDVKMDRARNTGVI SCTVCLEE FQTPI TYLSEPVDV 
YSDWIDACEAANQ 




301 


2571 


GEI.RVIAIAHGEIACHAVl.TASLLSLRSRLMDSDMDYERPNVKT 
IKCVWGDNAVGKTRLICARACNATLTQYQLLATHVPTVWAIDQ 
YRVCQEVLERSRDWDDVSVSLRLWDTFGDHHKDRRFAYGR3DV 
WLCFSIANPNSLjHHVKTMWYPEIKI-IFCPRAPVILVGCQLDLRY 
ADLEAVNRARRPLARPIKPNEILPPEKGREVAKELGIPYYETSV 
VAQFGI KDVFDNAI RAAL I SRRHLQFWKSHLRNVQRPLLQAP fl 
PPKPPPPIIWPDPPSSSEECPAHLLEDPLCADVILVLQERVRI 
FAHKIYLSTSSSKFYDLFLMDLSEGELGGPSEPGGTHPEDHQGH 
SDQHHHHHHHHHGKDFLLRAASFDVCESVDEAGGSGPAGLRAST 
SDG ILRGNGTG YLPGRGRVLS SWSRAFVS IQEEMAEDPLTYKSR 
LMVWKMDSSIQPGPFRAVLKYLYTGELDENERDLMHIAHIAEL 
LEVFDLRMMVANILNNEAFMNQEITKAFHVRRTNRVtCECLAKGT 
FSDVTFILDDGTISAHKPLLISSCDWMAAiMFGGPFVESSTREW 
FPYTSKSCMRAVLEYLYTGMFTSSPDLDDMKLIILANRLCLPHL 

CLHHICTNYNNVCRKFPRDMKAMSPENQEYFEKHRWPPVWYLKE 
EDHYQRARKEREKEDYLHLKRQPKRRWLFWNSPSSPSSSAASSS 
SPSSSSAW 


6468 


3 


13V* 


AALAPVGSPASRGPRLAAGLRLLPMLGLLQLLAEPG 
LGRVHHLALKDDVRHKVHLNTFGFFKDGYMWKVSSLSLKEPED 
KDVT1GFSLDRTKNDGFSSYLDEDVNYCILKKQSVSVTLLILDI 
SRSEVRVKSPPEAGTQLPKIIFSRDEKVLGQSQEPNVNPASAGN 
QTQKTQDGGKSKRSTVDSKAMGEKSFSVHNNGGAVSFQFFFNIS 
TDDOEGLYSLYFHKCLGKELPSDKFTFSLDIEITEKWPDSYLSA 
GEIPLPKLYISMAFFFFLSGTIWIHILRKRRNDVFKIHWLMAAL 
PFTKSLSLVFHAIDYHYISSQGFPIEGMAWYYITHLLKGALL? 
ITIALIGTGWAFIKHILSDKDKKIFMIVIPRRVLAWi/AYIIIES 
TEEGTTEYGLWKDSLFLVDLLCCGAILFPWWSIRHLQEASATD 
GKGKFSRAHFVLLSLL 


6469 


3 


1374 


D R TNtlAALAPVGSPASRGPRIAAGLRLLPMLGLLQLLAEPG 
LGRVHHLALKDDVRHKVHLNTFGFFKDGYMWKVSSLSLNEPED 
KDVTIGFSLDRTKNDGFSSYLDEDVNYCILKKQSVSVTLLILDI 
SRSEVRVKSPPEAGTQLPKIIFSRDEKVLGQSQEPNVNPASAGN 
QTQKTQDGGKSKRS T VDS KAMGEKS FS VHNNGGAVS FQFFFNIS 
rDDQEGLYSLYFHKCLGKELPSDKFTFSLDIEITEKMPDSYLSA 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 

amino acid 


Predicted end 

corresponding 

amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Iscleucine, K=Lysir.e, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GEIPLPKLYISMAFFFFLSGTIWIHI r P P'RRNDVFKIHWIjMAAL 
PFTKSLSLVFHAIDYHYISSQGFPIEGWAWYYITHLLKGALLF 
I T I AL I GTGWAF I KH I LSDKDKK I FM I VI PRRVLANVAY HIES 
TEEGTTEYGLWKDSLFLVDLLCCGAILFPWWSIRHLQEASATD 
GKGKFSRAHFVLLSLL 


5470 


2726 


1437 


AAASGVSSRADAPVLAQSPASAGNGRPSTPRVPGSRRHPSAPRS 
GPLPREDGCRTPGPQLLPLPGALLRPRTLLSSAAETGRSRHPDT 
QHPSSGGRCRGGTESPSSAAGRPASMAEAEEDCHSDTVRADDDE 
ENESPAETDLQAOLQMFRAQWMFELAPGVSSSNLEKRPCRAARG 
SLQKTSADTKGKQEQAKEEKARELFLKAVEEEQNGALYEAIKFY 
RRAMQLVPDIEFKITYTRSPDGDGVGNSYIEDNDDDSKMADLLS 

SDLDLRSLEQLSLVCRGFYICARDPSIWRLACLKVKGRSCIKLV 
PYTSWREMFLERPRVRFDGVYISKTTYIRQGEQSLEGFYRAWHQ 
VEYYRYIRFFPDGHVMMLTTPEEPQSIVPRLRTR 


6471 


1750 


299 


FFFDKMAAGGSGVGGKRSSKSDADSGFLGLRPTSVDPALRRRRR 
GPRNKKRGWRRLAQEPLGLEVDQFIiEDVRLQERTSGGLLSEAPN 
EKLFPVDTGSKEKGLTKKRTKVQKKSLLLKKPLRVDLILENTSK 
VPAPKDVLAHQVPNAKKLRRKEQLWEKL^KQGELPREVRRAQAR 
LLNPSATRAKPGPQDTVERPFYDLWASDNPLDHPLVGQDEFFLE 
QTKKKGVKRPARLHTKPSQAPAVEVAPAGASYNPSFEDHQTLLS 
AAHEVELQRQKEAEKLERQLALPATEQAATQESTFQELCEGLLE 
ESDGEGEPGQGEGPEAGDAEWrPTPARLATTEKKTEQQRRREKA 
VHRLRVQQAALRAARLRHQELFRLRGIECAQVALRLAELARRQRR 
RQARREAEADKPRRLGRLKYQAPDIDVQLSSELTDSLRTLKPEG 
NILRDRFKSFQRRfWj I h t l> I IVKLVEKRAFREIQIj 


6472 


3 


897 


SCGSDRAQWAMEFPFDVDALFPERITVLDQHLRPPARRPGTTTP 
ARVDLQQQIMTIIDELGKASAKAQNLSAPITSASRMQSNRHWY 
ILKDSSARPAGKGAIIGFIKVGYKKLFVLDDREAHNEVEPLCIL 
DFYIHESVQRHGHGRELFQYMLQKERVEPHQLAIDRPSQKLLKF 
LWKHYNLETTVPQVNNFVIFEGFFAHQHRPPAPSLRATRHSRAA 
AVDPTPAAPARKLPPKRAEGDIKPYSSSDREFLKVAVEPPWPKSf 
RAPRRATPPAHPPPRSSSLGNSPERGPLRPFVP 


6473 


22 


912 


SSAVEFVWEGEPMAAEPNKTEIQTLFKRLRAVPTWKACFDCGAK 
NPSWASITYGVFLCIDCSCVHRSLGTOLSFIRSTELDSNWNWFQ 
LRCMQVGGNANATAFFRQIIGCTANDANTKYNSRAAQMYREKIRQ 
LGSAALARHGTDLWIDNMSSAVPNHSPEKKDSDFFTEHTQPPAW 
DAPATEPSGTQQPAPSTESSGLAQPEHGPNTDLLGTSPKASLEL 
KSSIIG KKKPAAAKKGLGAKKGLGAQKVS SQSFSEI ERQAQVAE 
KLREQQAADAKKQAEESMVASMRLAYQELQIDR 


6474 




462 


LQRQRQHPAAAPAVPVRCFTFCFTDIVIMPKRKSPENTEGKDGS 
KVTKQEPTRRSARLSAKPAPPKPEPKPRKTSAKKEPGAKISRGA 
KGKKEEKQEAGKEGTAPSENGETKAEEIHISRSTVNVSTSRGTP 
PSTLSVKGQIETVRVKGTEN 


6475 


3 


462 


LQRQRQHPAAAPAVPVRCFTFCFTDIVIMPKRKSPENTEGKDGS 
KVTKQEPTRRSARLSAKPAPPKPEPKPRKTSAKKEPGAKISRGA 
KGKKEEKQEAGKEGTAPSENGETKAEEIHISRSTVNVSTSRGTP 
PSTLS VKGQIETVRVKGTEN 


6476 


106 


1090 


ARAMAQYKGTMREAGRAMHLLKKRERQREQMEVLKQRIAEETIL 
KSQVDKRFSAHYDAVEAELKSSTVGLVTLHDMKARQEALVRERE 
RQLAKRQHLEEQRLQQERQREQEQRRERKRKISCLSFALDDLDD 
QADAAEARRAGNLGKNPDVDTSFLPDRDREEEENRLREELRQEW 
EAQREKVKDEEMEVTFSYWDGSGHRRTVRVRKGNTVQQFLKKAL 
QGLRKDFLELRSAGVEQLMFI KEDLILPHYHTFYDFI IARARGK 
SGPLFSFDVHDDVRLLSDATMEKDESHAGKWLRSWYEKNKHIF 
PASRWEAYDPEKKWDKYTIR 
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SEQ 
ID 

NO: 


Predicted 
beginning 

corresponding 
to first 


corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , S=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


S477 


227 


915 


LQGHLMGIMAASRPLSRFWEWGKNIVCVGRHYADHVREMRSAVL 
SEPVLFLKPSTAYAPEGSPILMPAYTRNLHHELELGVVMGKRCR 
AVPEAAAMDYVGGYALCI.DMTARDVQDECKKKGLPWTLAKSFTA 
SCPVSAFVPKEKIPDPHKLKLWLKVNGELRQEGETSSMIFSIPY 
IISYVSKIITLEEGDIILTGTPKGVGPVKENDEIEAGIHGLVSM 
TFKVEKPEY 


S47S 




1495 


FVSSRILPESLASSEASTLEAMGRK3EDDCSSNKKQTTNIRKTF 
IFMEVLGSGAFSEVFLVKQRLTGKLFALKCIKKSPAFRDSSLEN 
EIAVLKKIKHENIVTLEDIYESTTHYYLVMQLVS3GELFDRILE 
RGVYTEKDASLVIQQVLSAVKYLHENGIVHRDLKPENLLYLTPE 
ENSKIMITDFGLSKMEQNGIMSTACGTPGYVAPEVLAQKPYSKA 
VDCWSIGVITYILLCGYPPFYEETESKLFEKIKEGYYEFESPFW 
DDI SESAKDFI CHLLEKDPNBRYTCEKALSHPWI DGNTALHRDI 
YPSVSLQIQKNFAKSKWRQAFNAAAVVHHMRKLHMNLHSPGVRP 

QHGRRPTAPGGRSLNCLWGSLHISSSLVPMHQGSLAAGPCGCC 
SSCLNIGSKGKSSYCSUPTTJ.KKANKKQNFKSEVMVPVKASGSS 
HCRAGQTGVCLIM 


6479 


3 


949 


R( 1 IHPAGGQAGAMELLSALSLGELALSFSRVPLFPVFDLS 
YFIVSILYLKYEPGAVELSRRHPIASWLCAMLHCFGSYILADLL 
LGEPLIDYFSNNSSILLASAVWYLIFFCPLDLFYKCVCFLPVKL 
IFVAMKEWRVRKIAVGIHHAHHHYHHGWFVMIATGWVKGSGVA 
LMSNFEQLLRGVWKPETNEILHMSFPTKASLYGAILFTLQi vrp.-J 
LPVSKASLIFIFTLFMVSCKVFLTATHSHSSPFDALEGYICPVL 
FGSACGGDHHHDNHGGSHSGGGPGAQHSAMPAKSKEELSEGSRK 
KKAKKAD 


6480 


192 


514 


DFMSIYFPIHCPDYLRSAKMTEVMMNTQPMEEIGLSPRKDGLSY 
QIFPDPSDFDRCCKLKDRLPSIWEPTEGEVESGELRWPPEEFL 
VQEDEQDNCEETAKENKEQ 


6481 


110 


1131 


KSRMDLDWNMFVI AGGTLAI P I LAFVAS FLLWPSAL IRIYYWY 
WRRTLGMQVRYVHHEDYQFCYSFRGRPGHKPSILMLHGFSAHKD 
MWLSWKFLPKNLHLVCVDMPGHEGTTRSSLDDLSIDGQVKRIH 
QFVECLKLNKKPFHLVGTSMGGQVAGVYAAYYESDVSSLWLVCP 
AGLQYSTDNQFVQRLKELQGSAAVEKIPLIPSTPEEMSEMLQLC 
SYVRFKVPQQILQCLVDVRIPHNNFYRKLFLEIVSEKSRYSLHQ 
NMDKIKVPTQIIWGKQDQVLDVSGADMLAKSIANCQVELLENCG 
HSWMERPRKTAKLI IDFLASVHNTDNNKKLD 


G482 






ALALSRVELHKNPEKEPYKSKYSARALLEEVKALLGPAPEDEDE 
RPEAEDGPGAGDHALGLPAEWEPEGPVAQRAVRLAVIEFHLGV 
NHIDTEELSAGEEHLVKCLRLLRRYRLSHDCISLCIQAQNNLGI 
LNSEREEIETAQAYLESSEALYNQYMKEVGSPPLDPTERFLPEE 
EKLTEQERSKRFEKVYTHNLYYLAQVYQHLEMFEKAAHYCHSTL 
KRQLEHNAYHPIEWAINAATLSQFYINKLCFMEARHCLSAANVI 
FGQTGKI SATEDTPEAEGEVPELYHQRKGEIARCWIKYCLTLMQ 
NAQLSMQDNIGELDLDKQSELRALRKKELDEEESIRKKAVQFGT 
GELCDAISAVEEKVSYLRPLDFEEARELFLLGQHYVFEAKEFFQ 
IDGYVTDHIEWQDHSALFKGLAFFETDMERRCKMHKRRIAMLE 
PLTVDLKPQYYLLVNRQIQFEIAHAYYDMMDLKVAIADRLRDPD 
SHIVKKINNLNKSALKYYQIiFLDSLRDPKKVFPEHIGEDVLRPA 
HLAKFRVARLYGKIITADPKKELEKLATSIiEHYKFlVDYCEKHP 
EAAQEIEVELELSKEMVSLLPTKMERFRTKMALT 


6483 


3 


623 


NSHLLiCGLRARAPLSANGREARAMEQRLAEFRAARKRAGIiAAQP 
PAASQGAQTPGEKAEAAATLKAAPGWLKRFLVWKPRPASARAQP 
GLVQEAAQPQGSTSETPWNTAIPLPSCWDQSFLTNITFLKVLLW 
LVLLGLFVELEFGLAYFVLSLFYWMYVGTRGPEEKKEGEKSAYS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=L,ysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrcsine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VFNP3CEAIQGTLTAEQLERELQLRPLAGR 


6484 


201 


965 


QLAVKTKMSGLRPGTQVDPE I ELFVKAGSDGES IGNCP FCQRLF 
MILWLKGVKFNVTTVDMTRKPEELKDLAPGTNPPFLVYNKELKT 
DFI KI EE FLEQTLAP PRYPHLS PKYKES FDVGCNLFAKFSAYI K 
NTQKEANKHFEKSLLKEFKRLDDYLNTPLLDEIDPDSAEEPPVS 
RRLFLDGDQLTLADCSLLPKLNI I KVAAKKYRDFDI PAEFSGVW 
RYLHMAYAREEFTHTCPEDKEIENTYANVAKQKS 


6485 


6 


1091 


FVDLVRAVEFLPCPDSQKLEKECQSSEESMGSNSMRSILEEDEE 
DEEPPRVLLYHEPRSFEVGMLWIHKHKKYPFWPAWKSVRQRDK 
KASVLYIEGHMNPKMKGFTVSLKSLKHFDCKEKQTLLNQAREDF 
NQDIGWCVSLITDYRVRLGCGSFAG3FLEYYAADISYPVRKSIQ. 
QDVLGTKLPQLSKGSPEEPWGCPLGQRQPCRKMLPDRSRAARD 
RANQXLVEYI GKAKGAESHLRAI LKS RKPSRWLQTFLS SSQYVT 
CVETYLEDEGQLDLWKYLQ3VYQEVGAKVLQRTNGDRIRFILD 
VLLPEAIICAISAGDEVDYKTAEEKYIKGPSLSYREKEIFDNQL 
LEERNRRRR 




10 


581 


LVLQAGGAHLSPSRVTQGIYYMLAFSEMPKPPDYSELSDSLTLA 
GGTGRFSGPLIIRAWRMTOTFRQRMGWIGVGLYLLASAAAFYYVFE 
ISETYNRLALEHIQQHPEEPLEGTTWTHSLKAQLLSLPFWVHTV 
IFLVPYLQMFLFLYSCTRADPKTVGYCIIPICLAVICNRHQAFV 
KASNQISRLQLIDT 


6487 


352 


863 


SFLKPLRGKMSVTLHTDVGDIKIEVFCERTPKTCENF1ALCASN 
YYNGCIFHRNIKGFMVQTHDPTGTGRGGNSIWGXKFEDEYSEYL 
KHNVRGWSMANNGPNTNGSQFFITYGKQPHLDMKYTVFGKVID 
GLETLDELEKLPVNEKTYRPLNDVHIKDITIHANPFAQ 


6488 


878 


241 


TALQEFGTSGPPLSLRFALPSGTGRFKPLPGARGPSWPPSPRVP 
MEPPNLYPVKLYVYDLSKGLARRLSPIMLGKQLEGIWHTSIWH 
KDEFFFGSGG I SSCPPGGTLLGPPDSWDVGSTEVTEEI FLEYL 
SSLGESLFRGEAYWLFEHNCNTFSHEVAQFLTGKKIPSYITaLP 
SEVLSTPFGQALRPLLDSIQIQPPGGSSVGRPNGQS 


6489 


1457 


375 


KVAKMATALSEEELDNEDYYSLLNVRREASSEELKAAYRRLCML 
YHPDKHRDPELKSQAERLFNLVHQAYEVLSDPQTRAIYDIYGKR 
GLEMEGWEWERRRTPAEIREEFERLQREREERRLQQRTNPKGT 
ISVGVDATDLFDRYDEE YEDVSGSS FPQ I EINKMHI SQS I EAPL 
TATDTAILSGSLSTQNGNGGGSINFALRRVTSAKGWGELEFGAG 
DLQGPLFGLKLFRNLTPRCFVTTNCALQFSSRGIRPGLTTVLAR 
NLDKNTVGYLQWHCSSPLLQVQRPHRNTRACAPEPSFRPFLHVP 
TWDAECSGARTPSTAWTSAAVKLREACLSGPGSGSHQLLLLTPR 
SKRRTGGG 


6490 


3 


1183 


IE [ JEV'\"GYGPRAZ«AAAATVLFGGAGPTETMFVARSIAADH 
KDLIHDVSFDFHGRRMATCSSDQSVKVWDKSESGDWHCTASWKT 
33VWR7TWAHPEFGQVLASCSFDRTAAWEEIVGESNDKLRG 
QSHWVKRTTLVDSRT3VTDVKFAPKHMGLMLATCSADGIVRIYE 
APDVMNLSQWSLQHEISCKLSCSCISWNPSSSRAHSPMIAVGSD 
DSSPNAMAKVQIFEYNENTRKYAKAETLMTVTDPVHDIAFAPNL 
GRSFHILAIATKDVRIFTLKPVRKELTSSGGPTKFEIHIVAQFD 
NHNS QWRVSWNI TGTVLASSGDDGCVRLWKANYMDNWKCTGI L 
) _ _ VNGSSQQGTGNPSLGSNIPSLQKSLNGSSAGRKHS 


6491 


3 


1183 


HE - -nVWLGYGPRAAAAAAATVLFGGAGPTETMFVARSIAADH 
KDLIKDVSFDFHGRRMATCSSDQSVKVWDKSESGDWHCTASWKT 
HSGS VWRVTWAHPE FGQVLAS CS FDRTAAVWEE I VGESNDKLRG 
QSHWVKRTTLVDSRTSVTDVKFAPKHMGLMLATCSADGIVRIYE 
APDVKNLSQWSLQHEISCKLSCSCISWNPSSSRAHSPMIAVGSD 
D S S PNAMAKV2 1 FE YNENTRK YAKAETLMT VTD PVHD I AFAPNL 
GRSFHILAIATKDVRIFTLKPVRKELTSSGGPTKFEIHIVAQFD 
NHNSQWRVSWNITGTVLASSGDDGCVRLWKANYMDNWKCTGIL 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 

corresponding 

amino acid 

amino acid 
sequence 


^mno acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








I iNGSE VKGSSQQ3TSNPSLGSNIPSLQNSLNGSSAGRKHS " 


6492 


34 


2573 


IPFLKSCCCCCLFDFPPPPLDQVQEE3CEVERVTEHGTPKPFRK 
FDSVAFGESQSEDEQFENDLETDPPNWQQLVSREVLLGLKPCEI 
; KRQEVINELFYTERAHVRTLKVLDQVFYQRVSREGILSPSELRK 
IFSULED1LQLHIGLK-EQMKAVRKRNETSVIDQIGEDLLTWFSG 
PGEEKLKHAAATFCSNQPFALEMIKSRQKKDSRFQTFVQDAESN 
PLCRRLQLKDIIPTQMORLTKYPLLLDNIATYTEWPTFREKVKK 
AADHCRQILNYVNQAVKEAENKQRLEDYQRRLDTSSLKLSEYPN 
VE E LRNLDLTKR KM 1 HEGPLVWKVNRD KTIDLYTLLLEDI LVLL 
QKQDDRLVLRCHSKILASTADSKHTFSPVIKLSTVLVRQVATDN 
KALFVISMSDNGAQIYELVAQTVSEKTVWQDLICRMAASVKEQS 
TKPIPLPQSTPGEGDNDEEDPSKLKEEQHGISVTGLQSPDRDLG 
LESTLISSKPQSHSLSTSGKSEVRDLFVAERQFAKEQHTDGTLK 
EVGEDYQIAIPDSHLPVSEERWALDALRNLGLLKQLLVQQLGLT 
EKSVQEDWQHFPRYRTASQGPQTDSVIQNSENIKAYHSGEGHMP 
FRTGTGDIATCYSPRTSTESFAPRDSVGIAPQDSQASNILVMDH 
MIMTPEMPTMEPEGGIJDDSGEHFFDAREAHSDENPSEGDGAVNK 
EEKDVWLRISGNYLII.DGYDPVQESSTDEEVASSLTLQPMTGIP 
AVESTHQQQHSPQNTHSDGAISPFTPEFLVQQRWGAMEYSCFEI 
QSPSSCADSQSQIMEYIHKIEADLEHLKKVEESYTIliCQRLAGS 
ALTDKHSDKS 


6493 


557 


1147 


TPARMAYQGSSTSDCMSKTLDSASAHFAASAVVSAPVPSRKEVA 
KEONTGHNNINGWQPSGTSKTLYSTNMALSSSPGISAVQLVRT 
VGHTTTNHLIPALCTSSPQTLPMNNSCLTNAVHLNNVSWSPVN 
VHINTRTSAPSPTALKLATVAASMDRVPKVTPSSAISSIARENH 
EPERLGLNGIAETTVAMEVT 


6494 


2425 


1052 


AVAGGARPCSTPSSPHRRCRRHRPRPLPRPPAAIMSASAVYVLD 
LKGKVLICRNYRGDVDMSEVEHFMPILMEKEEEGMLSPILAHGG 
VRFMWIKHNWLYLVATSKKNACVSLVFSFLYKWQVFSEYFKEL 
EEESIRDNFVIIYELLDELMDFGYPQTTDSKILQEYITQEGHKL 
ETGAPRP PATVTNAVSWRSEG I KYRKNEVFLDVI ES VNLLVSAM 
GNVT.RSEIVGSIKMRVFLSGMPELRLGLNDKVLFDNTGRGKSKS 
VELEDVKFHQCVRLSRFENDRTISFIPPDGEFELMSYRL-NTHVK 
PLIWIESVIEKHSHSRIEYMIKAKSQFKRRSTANNVEIHIPVPN 
DADSPKFKTTVGS VKWVPENSE I VWS I KS FPGGKE YLMRAHFGL 
PSVEAEDKEGKPPISVKFEIPYFTTSGIQVRYLKIIEKSGYQAL 
PWVRYITQNGDYQLRTQ 


6495 


2425 


1052 


AVAGGARPCSTPSSPHRRCRRHRPRPLPRPPAAIMSASAVYVLD 
LKGKVL I CRNYRGDVDMSEVEHFMP ILMEKEEEGMLS PILAHGG 
VRFMWIIOINNLYLVATSKKNACVSLVFSFLYKVVQVFSEYFKEL 
EEESIRDNFVIIYBLLDELMDFGYPQTTDSKILQEYITQEGHKIi 
ETGAPRPPATVTNAVSWRSEGIKYRKNEVFLDVIESVNLLVSAN 
GtJVlRSEIVGSIKMRVFLSGMPELRLGLNDKVLFDNTGRGKSKS 
VELEDVKFHQCVRLSRFENDRT I S FIPPDGE FELMS YRLNTH VK 
PLIWIESVIEKHSKSRIEYMIKAKSQFKRRSTANNVEIHIPVPN 
DADSPKFKTTVGSVKWVPENSEIWSIKSFPGGKEYLMRAHFGL 
PSVEAEDKEGKPPISVKFEIPYFTTSGIQVRYLKIIEKSGYQAL 
PWVRY I TQNGD YQLRTQ 


6496 


247 


559 


i..-a-.':llplqlvlpeysihslfcimflcaqewltlgliivpllfy 
hfwr 1tkcpadsselaydppwmnadtlsycqkeawcklafyll 
ffyylycmiytlvss 


6497 


1053 


352 


T ^LHPPCGAKKGNGTEEDYNFVFKWLIGESGV 
C-XTNLLSRFTRNEFSHDSRTTIGVEFSTRTVMLGTAAVKAQIWD 
TAGLERYRAITSAYYRGAVGALLVFDLTKHQTYAWERWLKELY 
DHAEATIWMLVGNKSDLSQAREVPTEEARMFAENNGLLFLETS 
AIDS TNVE LAFE T VLKE I FAKVSKQRQNS I RTNA1TLGSAQAGQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 

residue of 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-,Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EPGPGEKRACCISL 


5498 


2635 


272 


SLRLCPWGTHLAGPTTMRLSSLLALLRPALPLILGLSLGCSLSL 
LF ' SNIQGKL t] )1 C K Kll L'QNPDSRARLDQEDEDFKPRI 
VPYYRDPNKPYKKVLRTRYIQTELGSRERLLVAVLTSRATLSTL 
AVAVNRTVAHHFPRLLYFTG2RGARAPAGMQWSHGDERPAW1M 
SETLRHLHTHFGADYDWFFIMQDDTYVQAPRLAALAGHLSINQD 
LYLGRAEEFIGAGECARYCHGGFGYLLSRSLLLRLRPHLDGCRG 
DILSARPDEWLGRCLIDSLGVGCVSQHQGQQYRSFELAKNRDPE 
KEGSSAFLSAFAVHPVSEGTLMYRLHKRFSALELERAYSEIEQL 
QAQIRNLTVLTPEGEAGLSWPVGLPAPFTPHSRFEVLGWDYFTE 
QHTFSCADGAPKCPLQGASRADVGDALETALEQLNRRYQPRLRF 
QKQRLLNGYRRFDPARGMEYTLDLLLECVTQRGHRRALARRVSL 
LRPLSRVEILPMPYVTEATRVQLVLPLLVAEAAAAPAFLEAFAA 
NVLEPREHALLTLLLVYGPREGGRGAPDPFLGVKAAAAELERRY 
PGTRLAWLAVRAEAPSQVRLMDWSKKHPVDTLFFLTTVWTRPG 
PFVHt 2EFNPALSPQRSPPGPPGAGF 
DPPSPPGADPSRGAPIGGRFDRQASAEGCFYNADYLAARARLAG 
ELAGQEEEEALEGLEVMDVFLRFSGLHLFRAVEPGLVQKFSLRD 
CSPRLSEELYHRCRLSNLEGLGGRAQLAMALFEQEQANST 


S499 


3 


2040 


SCSADTRPSGQAWPTVGLRAAAGAFR'1'GSPLALGPETPQVACLP 
GHPPVRPQVSGGPGAMPDPAAHLPFFYGSISRAEAEEHLKLAGM 

GKAHCGPAELCEFYSRDPDGLPCNLRKPCNRPSGLEPQPGVFDC 
LRDAMVRDYVRQTWKLEGEALEQAIISQAPQVEKTjIATTAHERM 
PWYHSSLTREEAERKLYSGAQTDGKFLLRPRKEQGTYALSLIYG 
KTVYHYLI SQDKAGKYC I PEGTKFDTLWQLVE YLKLKADGL I YC 
LKEACPNSSASNASGAAAPTLPAHPSTLTHPQRRIDTLNSDGYT 
PEPARITSPDKPRPMPMDTSVYESPYSDPEELKDKKLFLKRDNL 
LIADIELGCGNFGSVRQGVYRMRKKQIDVAIKVLKQGTEICADTE 
EMMREAQIMHQLDNPYIVRLIGVCQAEALMLVMEMAGGGPLHKF 
LVGKREEIPVSNVAEIjLHQVSMGMKYLEEKNFVHRDLAARWIjL 
VNRHYAKISDFGLSKALGADDSYYTARSAGKWPLKWYAPECINF 
RKFSSRSDVWSYGVTMWEALSYGQKPYKKMKGPEVMAFIEQGKR 
MECPPECPPELYALMSDCWIYKWEDRPDFLTVEQRMRACYYSLA 
SKVEGPPGSTQKAEAACA 


6500 


1773 


726 


^GPTHASADAWGLVRSVTEWCANVRGNPCAAALSCPQAVLDAGK 
MLSE S S S FLKGVMLG S IFCALI TMLGH I R IGHGNRMHHHEHHHL 
QAPNKEDILKISEDERMELSKSFRVYCIILVKPKDVSLWAAVKE 
TWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDK 
YRDQYNWFFLARPTTFAIIENLKYFLLKKDPSQPFYLGHTIKSG 
DLEYVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISED 
KQLAVCLKYAGVFAENAEDADGKDVFWTKSVGLS I KEAMTYHPN 
QWEC-CCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGPYFQ 


65-01 






G GGTETP VGCEAAPGGGSKKHDSLGTAGSAHL I IKDLGE 
IHSRLLDHRPVIQGETRYFVKEFEEKRGLREMRVLENLKNMIHE 
TNEHTLPKCRDTMRDSLSQVLQRIiQAANDSVCRLQQREQERKKI 
HSDHLVASEK2HMLQWDNFMKEQPNKRAEVDEEHRKAMERLKEQ 


6502 


213 


1650 


AG1 .1 DPWAGRNRTAVLPDVS VFHREDVGWWRSWLQQSYQAVKE 
K S S E ALE FMKRDLTEFTQ WQHDTACTI AATASWKEKLATEGS 
SGATEKMKKGLSDFLGVISDTFAPSPDKTIDCDV1TLMGTPSGT 
AEPYDGTKARLYSLQSDPATYCNEPDGPPELFDAWLSQFCLEEK 
KGEISELLVGSPSIRALYTKMVPAAVSHSEFWHRYFYKVHQLEQ. 
EQARRDALKQRAEQS ISEEPGWEEEEEELMGI S PI S PKEAKVPV 
AKISTFPEGEPGPQSPCEENLVTSVEPPAEVTPSESSESISLVT 
QIAMPATAPEARVLPKDLSQKLLEASLEEQGLAVDVGETGPSPP 
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SEQ 
ID 

NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

amino acid 


Predicted end 
nucleotide 

cor re sponding 

residue of 
amino acid 


Amino acid segment conzaining signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
II=IIistidine , I=Isoleucine, K=Lysine, 
L=Leucine, M=Methior.ine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleobide deletion, 
\=possible nucleotide insertion) 








IHSKPLTPAGHTGGPEPRPPARVETLREEAPTDLRVFELNSDSG 
KSTPSNNGKKGSSTDISEDWEKDFDLDMTEEEVQMALSKVDASG 
EVSGPGGSEGSEPNGPGCESSPQPAQLSPQEGPCSCLR 


S503 


213 


1650 


AGNKPDPWAGRNRTAVLPDVSVFHREDVGWWRSWLQQSYQAVKE 
KS S 3ALEFMKRDLTE FTQWQHDTAGT I AATAS WKE KLATEGS 
SGATEKMKKGLSDFLGVISDTFAPSPDKTIDCDVITLMGTPSGT 
AEPYDGTKARLYSLQSDPATYCNEPDGPPELFDAWLSQFCIjEEK 
KGEISELLVGSPSIRALYTKMVPAAVSHSEFWHRYFYXVHQLEQ 
EQARRDALKQRAEQSISEEPGWEEEEEELMGISPISPKEAKVPV 
AKISTFPEGEPGPQSPCEENLVTSV3PPAEVTP3ESSESISLVT 
QIANPATAPEARVLPKDLSQKLLEASLEEQGLAVDVGETGPSPP 
IHSK?LTPAGHTGGPEPRPPARVETLREEAPTDLRVFELNSDSG 
KSTPSNNGKKGSSTDISEDWEKDFDLDMTEEEVQMALSKVDASG 
EVSGPGGSEGSEPNC-PGCESSPQPAQLSPQEGPCSCLR 


S504 


2131 




SANMTKKKVSQKKQRGRPSSQPCRNIVGCKISHGWKEGDEPITC 
WKGTVUDQVP INPSL YLVKYDfi I DCVYGLELHRDERVLS LKILS 
DRVASSHISDANLANTIIGKAVEHMFEGEHGSKDEWRGHVLA:ji 
PIKKAWFYITYEKDPVTVYMYQLLDDYKEGDLRIMPESSESPPTE 
REPGGWDGLIGKHVEYTKEDGEKRIGMVIIIQVEAKPSVYFIKF 
DDDFHIYVYDLVKKS 


6505 


2131 


1294 


GaVCLVMIKVCLSILSPPPAGMKTPNAQEAEGQQTRAAAGRATG 
SANMTKKKVSQKKQRGRPSSOPCRNIVGCRISHGWKRGDEPITQ 
WKGTVLDQVPINPSLYLVKYDGIDCVYGLELHRDERVLSLKILS 
DRVASSHISDANLANTIIGKAVEHMFEGEHGSKDEWRGMVLAQA 
PIMKAWFYITYEKDPVLYMYQLLDDYKEGDLRIMPESSESPPTE 
REPGG WDGL IGKHVE YTKEDGSKRI GMVIHQVEAKPSVYF I KF 
DDDFHIYVYDLVKKS 


6506 




1350 


EVSPPTSCCLTVAVADPGVSEGFRGFGAGCEMPGRGRCFDCGST 
ELVEDSHYSQSQLVCSDCGCWTEGVLTTTFSDEGNLREVTYSR 
STGENEQVSRSQQRGLRRVRDLCRVLQLPPTFEDTAVAYYQQAY 
RHSGIRAARLQKKEVLVGCCVLITCRQHNWPLTMGAICTLLYAD 
LDVFSSTYMQIVKLLGLDVPSLCLAELVKTYCSSFKLFQASPSV 
PAKYVEDKEKMLSRTMQLVELANETWLVTGRHPLPVITAATFLA 
WQSLQPADRLSCSLARFCKLANVDLPYPASSRLQELLAVLLRMA 
EQLAWLRVIjRLDKRSWKHIGDLLQHRQSLVRSAFRDGTAEVET 
REKEPPGWGQG0GEG3VGNNSLGLPQGKRPASPALLLPPCMLKS 
PKRICPVPPVSTVTGDENISDSEIEQYLRTPQEVRDFQRAQAAR 
QAATSVPKPP 


6507 


1878 


929 


RSHASRLPELPSGCLVLQVQELVQMSGMEATVTIPIWQNKPHGA 
ARSWRRIGTNLPLKPCARASFETLFNISDLCLRDVPPVPTLAD 
IAWIAADEEETYARVRSDTRPLRHTWKPSPLIVKQRWASVPNLR 
GSEERLLALKKPALPALSRTTELQDELSHLRSQIAKIVAADAAS 
ASLTFDFLSPGSSNVSSPLPCFGSSFHSTTSFVISDITEETEVE 
VPELPSVPLLCSASPECCKPEHKAACSSSEEDDCVSLSKASSFA 
DMMGILKDFHRMKQSQDLNRSLLKEEDPAVLISEVLRRKFALKE 
EDISRKGN 


6508 


862 


342 


WEARKRPQRWPSERREVRVPPPHLQRGRSGLEPGTFRKMAAARP 
SLGRVLPGSSVLFLCDMQEKFRHNIAYFPQIVSVAARMLXNTTL 
DLLDRGLQVHVWDACSSRSQVDRLVALARMRQSGAFIjSTSEGL 
ILQLVGDAVHPQFKEIQKLIKEPAPDSGLLGLFQGQNSLLH 


6509 


2 


1053 


FVWNPRGGRKRRRQAAVTQAATRASGTPSPRDGTMTQGKLSVAN 
KAPGTEGQQQVHGEKKEAPAVPSAPPSYEEATSGEGMKAGAFPP 
APTAVPLHPSWAYVDPSSSSSYDNGFPTGDHELFTTFSWDDQKV 
RRVFVRKVYTILLIQLLVTLAWALFTFCDPVKDYVQANPGWYW 
ASYAVFFATYLTLACCSGPRRHFPWNLILLTVFTLSMAYLTGML 
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SEQ 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine ( 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SSYYNTTSVLLCLGITALVCLSVTVFSFQTXFDFTSCQGVLFVL 
LMTLFFSGLILAILLPFQYVPWLHAVYAALGAGVFTLFLALDTQ 
LLMGNRRHSLSPEEYIFGALNIYLDIIYIFTFFLQLFGTNRE 


S510 




1156 


PCALDGCPQRGAVHPLLSSAMGLLAFLKTQFVLHLLVGFVFWS 
GLVlNFVQLCTLALt/PVSKQLYRRLKCRLAYSLWSQLVMLLEWW 
SCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERF 
GVLGSSKVLAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTW 
EGLRRLSDYPEYMWFLLYCEGTRFTETKHRVSMEVAAAKGLPVL 
KYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFR3NKNPSLLGIL 
YGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIY 
NQKGMFPGEQFKPARRPWTLLNFLS WAT I LLS PLFS FVLGVFAS 
GSPLLILTFLGFVGAGNGHCR 


£511 


2 541 


1425 


GEEQPLAAAPTECLEQVIGGAGDPGTWASFPSPLPGPAPLKGGK 
TMATNFSDIVKQGYVKMKSRKLGIYRRCWLVFRKSSSKGPQRLE 
KYPDEKSVCLRGCPKVTEISNVKCVTRLPKETKRQAVAIIFTDD 
SARTFTCDSELEAEEWYKTLSVECLGSRLNDISLGEPDLLAPGV 
QCEQTDRFNVFLLPCPNLDVYGECKLQITHENIYLWDIHNPRVK 
LVSWPLCSLP.RYGRDATRFTFEAGRMCDAGEGLYTFQTQEGEQI 
YQRVHSATLAIAEQHKRVLLEMEKNVRLLNKGT3I1YSYPCTPTT 
MLPRSAYWHHJTGSQNIAEASSYAGEGYGAAQASSETDLLNRFI 
LLKPKPSQGDSSEAKTPSQ 


6512 


159 




FGXKSTWFPLSRSLRVASGRSCKLGHGGYTGSG?GFGKPRDSGA 
EVPSGSGRATGCERGGVRGARQGRAPGSSIWRKEPRMVCTRKTK 
TLVSTCVILSGMTNI I CLLYVGMVTNYIAEVYVRGQEPAPDKKL 
EEDKGDTLKI1ERLDHLENVIKQHIQEAPAKPEEAEAEPFTDSS 
LFAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLDRP 


6513 


2 


756 


FVSPEPGFSIAQLNLIWQLTDTKQLVHSFAEGQDQGSAYANRTA 
LFPDLLAQGNASLRLQRVRVADEGSFTCFVSIRDFGSAAVSLQV 
AAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQ 
GVPLTGNVTTSQMANEQGLFDVHSILRWLGAWGTYSCLVRNPV 
LQQDAHSSVT1TPQRSPTGAVEVQVPEDPWALVGTDATLRCSF 
SPEPGFSLAQLNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALF 
PDLLAQGNASLRLQRVRVADEGSFTCFVSIRDFGSAAVSLQVAA 
PYSKPSMTLEPNKDLRPGDTVTTTCSSYQGYPEAEVFWQDGQGV 
PLTGNVTTSQMANEQGLFDVHS ILRWLGANGTYS CLVRNPVLQ 
QDAHSSVTITPQRSPTGAVEVQVPEDPWALVGTDATLRCSFSP 
EPGFSLAQLNLIWQLTDTRQLVHSFTEGR 


6514 


9B5 


302 


VGIPGPTISSAAEMEDLLDLDEELRYSLATSRAKMGRRAQQESA 
QAENHLNGKNSSLTLTGETSSAKLPRCRQGGWAGDSVKASKFRR 
KASEE I EDFRLRPQSLNGSDYGGDIPI I PDLEEVQEEDFVLQVA 
APPS I Q I KRVMTYRDLDNDLI4KYSAIQTLDGEIDLKLLTKVLAP 
3HEVR2RNPSWQDDVGWDWDHLFTEVSS3VLTEWDPLQTEKEDP 
AGQARHT 


6515 


1345 


305 


GRVGSRRRGAAVPGGCGAGSTQLEVSASASCGALGSADMNPIW 
VH 3 :- SAGPI SKDRKERVHQGMVRAATVGYGILREGGSAVDAVEG 
AWALEDDPEFNAGCGSVLNTNGEVEMDASIMDGKDLSAGAVSA 
VQCIANPIKLARLVMEKTPHCFLTDQGAAQFAAAMGVPEIPGEK 
L VTERWKKRLE KE KHE KGAQKTD GQKNLGTV3AVALDCK3NVAY 
ATSTGGIVNKMVGRVGDSPCLGAGGYADNDIGAVSTTGH3ESIL 
KVNLARLTLFHIEQGKTVEEAADLSLGYMKSRVKGLGGLIWSK 
T KWT ii HAAAKDGKLHFGIDPDDTTITDLP 


6516 


1 


1402 


F » 'iLGQDATAAARDLRTRGLQGYCPSATARQQVLVSALQQL 
KGRRSEHRNENQEMPYSTNKELILGIMVGTAGISLLLLWYHKVR 
KPGIAMKLPEFLELGNTFNSITLQDEIHDDQGTTVIFQERQLQI 
LEKLNELLTNMEELKEEIRFLKEAIPKLEEYIQDELGGKITVHK 
ISPQHRARKRRLPTIQSSATSNSSEEAESEGGYITANTDTEEQS 
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SEQ 
NO: 


beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 

corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F-Pheny Ictldiiii.e , G-Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








FPVPKAFNTRVEELNLDVLLQKVDHLRMSESGKSESFELLRDHK 
3 XFRDE I EFMWRFARAYGDMYELSTNTQEKKHYANIGKTLS ERA 
INRAPMNGHCHLWYAVLCGYVSEFEGLQNKINYGHLFKEHLDIA 
IKLLPEEPFLYYLKGRYCYTVSKLSWIEKKMAATLFGKIPSSTV 
QEALHNFLKAEELCPGYSNPNYMYIiAKCYTDLEENQNALKFCNL 
ALLLPTVTKEDKEAQKEMQKIMTSLKR 


6517 


3 


1414 


GRVWGGSSSLNAMVYVRGHAEDYERWORQGARGWDYAHCLPYFR 
KAQGHELGASRYRGADGPLRVSRGKTNHPLHCAFLEATQQAGYP 
LTEDMNGFQQEGFGWMDMTIHKGKRWSAACAYLHPALSSTNLKA 
EAETLVSRVLFEGTRAVGVEYVKNGQSHRAYASKEVILSGGAIN 
SPQLLMLSGIGNADDLKKLGIPWCHLPGVGQNLQDHLEIYIQQ 
ACTRPITLHSAQKPLRKVCIGLEWLWKFTGEGATAHLETGGFIR 
SQPGVPHPDIQFHFLPSQVIDHGRVPTQQEAYQVHVGPMRGTSV 
GWLKLRSANPQDHPVIQPNYLSTETDIEDFRLCVKI.TREIFAQE 
ALAPFRGKELQPGSHIQSDKEIDAFVRAKADSAYHPSCTCKMGQ 
PSDPTAWDPQTRVLGV3NLRWDASIMPSMVS3NI.NAPTIMIA 
E KAAD 1 1 KGQPALWDKDVPVYKPRTLATQR 


5518 


242 


1098 


PAWNPGSEPRTRVRPRARSFPLPPPRAPRRRRHRLLRAVPGPSR 
RHRCRRRAPPPPSTMGDAGSEREKAPELPPRCPCGPWGSSJCTMN 
LCSKCFADFQKKQPDDDSAPSTSNSQSDLFSEETTSDNNNTSIT 
TPTLSPSQQPLPTELNVTSPSKEECGPCTDTAHVSLITPTKRSC 
GTDSQ3ENEASPVKRPRLLENTERSEETSRSKQKSRKRCF0CQT 
KLELVQQELGSCRCGYVFCMLHRLPEQHDCTFDHMGRGREEAIM 
KK VKLDRKVGRS CQR IGEGCS 


6519 


3 


1113 


E SI tL^PFSPVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLS 
AKKVRTEEKKAPRRVNGEGGSGGNSRQLQPPAAPSPQSYGSPAS 
WSFAPLSAAPSPSSSRSSFSFSAGTAVPSSASASLSQPGPRKLL 
VPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKREKERRRHGL 
GGAREAGGASREENGEVKPLPRDKIKDKIKERDKEKEREKKXHK 
VMNE I KKENG E VK I LLKSG KEKP KTN I EDLQ I KKVKKKKKKKHK 
ENEKRKRPKMYSKSIQTICSGLLTDVEDQAAKGILNDNIKDYVG 
KNLDTKNYDSKIPENSEFPFVSLKEPRVQNNLKRLDTLEFKQLI 
HIEHQPNGGASVIHCLQ 


6520 


3 


1113 


ERKMAE P P S PVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLS 
AKKVRTEEKKAPRRVNGEGGSGGNSRQLQPPAAPSPQSYGSPAS 
WSFAPLSAAPSPSSSRSSFSJSAGTAVPSSASASLSQPGPRKLL 
VPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKREKERRRHGL 
GGAREAGGASREEWGEVKPLPRDKIKDKIKERDKEKEREKKKHK 
VMNEIKKENGEVKILLKSGKEKPKTNIEDLQIKKVKKKKKKKHK 
ENEKRKRPKMYSKSIQTICSGLLTDVEDQAAKGILNDNIKDYVG 
KNLDTKNYDSKI PENS EFP FVSLKEPRVQNNLKRLDTLEFKQL I 
HIEHQPNGGASVIHCLQ 


6521 


184 


1798 


KLFKMATDTSQGELVHPKALPLIVGAQLIHADKLGEKVEDSTMP 
IRRTVNSTRETPPKSKLAEGEEEKPEPDISSEESVSTVEEQENE 
TPPATSSEAEQPKGEPENEEKEENKSSEETKKDEKDQSKEKEKK 
VKKTIPSWATLSASQLARAQKQTPMASSPRPKMDAILTEAIKAC 
FQ] jr'ASWAIRKYIIHKYPSLELERRGYLLKQALKRELNRGVI 
KQVKGKGASGSFVWQKSRKTPQKSRNRKNR3SAVDPEPQVKLE 
DVLPLAFTRLCEPKEASYSLIRKYVSQYYPKLRVDIRPQLLKNA 
LQRAVERGQLEQITGKGASGTFQLKKSGEKPLLGGSLMEYAILS 
AIAAMNEPKTCSTTALKKYVLENHPGTNSNYQMHLLKKTLQKCE 
1 KG 1 IEQ1SGKGFSGTFQLCFPYYPSPGVLFPKKEPDDSRDEDE 
DEDE3SEEDSEDEEPPPKRRLQKKTPAKSPGKAASVKQRGSKPA 
PKVSAAQRGKARPLPKKAPPKAKTPAKKTRPSSTVIKKPSGGSS 
KKPATSARKE 


5522 


1042 


391 


NKWLRPSPRSHRTPESGRVLSLFRLPPPGMALSGSTPAPCWEED | 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECLDYYGMLSLHRMFEWGGQLTECELELLAFLLDEAPGAAGGL 
SRARSGLKLLLELERRGQCDESNLRLLGQLLRVLARHDLLPHLA 
RKRRRPVS PERYS YGTS SS S KRTEGS CRRRRQSS SSANSQQGS P 
PTKRQRRSRGRPSGGARRRRRGPQPHPSSSQSPPDLPLKAK 


6523 




1097 


AS CQTRRRTAALDS GE R I AGRRS P I ALAMASNFND I VKQGYVKI 
RSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT 
ELHNI KN I TRLPRETKKHAVAI I FHDETS KTFACESELEAEEWC 

LDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDST 
WFTFESGRMCDTGEGLFrFQTREGEMIYQKVHSATLAIAEQHER 
LMLEMEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGE 
IYSLQGNHENRHSDLTGKSCKTSENRFLEENAPLVMYGITHHLF 
MDTSTCKWHDLE 


S524 


2 


1097 


AS CQTRRRTAALDSGERI AGRRS PIALAMASNFND I VKQGYVKI 
RSRKLGIFRRCWLVFKKASSKGPRRLEKFPDEKAAYFRNFHKVT 
ELHNI KN I TRLPRETKKHAVAI I FHDETS KTFACESELEAEEWC 
ICL3TRLNDI GE DLI.AA3 QRH QNE RFNV YT .M PT PN 
LD I YGECTMQ I THENI YLWD IHNAKVKLVMWPLSSLRRYGRDST 
WFTFESGRMCDTGEGLFTFQTREGEMIYQKVHSATLAIAEQHER 
LMLEMEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGE 
IYSLQGNHENRHSDLTGKSCKTSENRFLEENAPLVMYGITHHLF 
MDTSTCKWHDLE 








PVKTPSDAGNSPIGFCPGSDE F F SIQSSRYK 
KESKSGLVKPGSEADFSSSSSTGSISAPEVHMSTAGSKRSSSSR 
NRGPHGRSNGAS SHKPGS S PSS PREKDLLSMLCRNQLS PVNIHP 
SYAPSSPSSSNSGSYKGSDCSPIMRRSGRYMSCGENHGVRPPNP 
EQYLTPLQQKEVTVRHLKTKLKESERRLHERESEIVELKSQLAR 
MREDWIEEECHRVEAQLALKEARKEIKQLKQVIETMRSSLADKD 
KGIQKYFVDINIQNKKLESLLQSMEMAHSGSLRDELCLDFPCDS 
PEKSLTLNPPLDTMADGLSLEE0VTGEGADRELLVGDSIAN3TD 
LFDEIVTATTTESGDLELVHSTPGAIvlVLELLPIVMGQEEGSVW 
ERAVQTDWPYSPAISELIQSVLQKLQDPCPSSLASPDESEPDS 
MESFPESLSALWDLTPRNPNSAILLSPVETPYANVDAEVHANR 
LMRELDFAACVEERLDGVIPLARGGWRQYWSSSFLVDLLAVAA 
PWPTVLWAFSTQRGGTDPVYNIGALLRGCCWALIISLRRTAFR 
IKT 


6526 


2 


2034 


SGRAGEPEEWRGRQIIDSKETWIPFNSEDSQQLEEAYSSGKGCN 

NKYVP Y SKSFS Q VLEETYMLAVTLDEWKKKLES PNRE III LHNP 
KLMVHYQPVAGSDDWGSTPMEQGRPRTVKRGVENISVDIHCGEP 
LQIDHLVFWHGIGPACDLRFRSIVQCVNDFRSVSLNLLQTHFK 
KAQENQQIGRVEFLPVNWHSPLHSTGVDVDLQRITLPSINRLRH 
FTNDTILDVFFYNSPTYCQTIVDTVASEMNRIYTLFLQRNPDFK 
GGVSIAGHSLGSLILFDILTNQKDSLGDIDSEKGSLNIVMDQGD 
TPTLEEDLKKLQLSEFFDIFEKEKVDKEALALCTDRDLQEIGIP 
LGPRKKILNYFSTRKNSMGIKRPAPQPASGANIPKE3EFCSSSN 
TRNGDYLDVGIGQVSVKYPRLIYKPEIFFAFGSPIGMFLTVRGL 
KRIDPNYRFPTCKGFFNIYHPFDPVAYRIEPMWPGVEFEPMLI 
PHHKGRKRMHLELREGLTRMSMDLKNNLLGSLRMAWXSFTRAPY 
PALQASETPEETEAEPESTSEKPSDVNTEETSVAVKEEVLPINV 
GMLNGGQRIDYVLQEKPIESFNEYLFALQ3HLCYWESEDTVLLV 
LKEIYQTQGIFLDQPLQ 


6527 


1 


922 


GWVPLLSRILPSDACKIYKQGINIRLDTTLIDFTDMKCQRGDLS 
FIFNGDAAPSESFWLDNEQKVYQRIHHEESEMETEEEVDILMS 
SDIYSATLSTKSISFTRAQTGWLFREDKT3RVGNFLADFYLVNG 
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SEQ 
ID 
NO: 


Predicted 

corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 

cor re eponding 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I-Iooleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, E=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LVLESRKRREHLSEEDILRNKAIMESLSKGGNIMEQNFEPIRRQ 
SLTPPPQNT1TWEEYISAENGKAPHLGRELVCKESKKTFKATIA 
MSQEFPLGIELLLNVLEWAPFKHFNKLREFVQMKLPPGFPVKL 
DI PVFPTITATVTFQEFRYDE FDGS I FT I PDDYKEDPSRFPDL 


6528 




1073 


LTGPAAAEPRCAADAGMKRA1GRRKGWLRLRKILFCVLGLYIA 
IPFLIKLCPGIQAKLIFLNFVRVPYFIDLKKPQDQGLNHTCNYY 
LQPEEDVTIGVWHTVPAVWWKNAQGKDQMWYEDALASSIIP I ILY 
LHGNAGTRGGDHRVELYKVLSSLGYHWTFDYRGWGDSVGTPSE 
RGMTYDALHVFDWIKARSGDNPVYIWGH3LGTGVATNLVRRLCE 
RETPPDALILESPFTNIREEAKSHPFSVIYRYFPGFDWFFLDPI 
TSSGIKFANDENVKHISCPLLILHAEDDPWPFQLGRKLYSIAA 
PARSFRDFKVQFVPFHSDLGYRHKYIYKS PELPRILREFLGKSE 
PEHQH 


6529 


363 


2215 


THIRYNKIGWKTMSCGNEFVETLKKIGYPKADMLNGEDFDWLF 
EGVEDESFLKWFCGNVNEQIT'/LSERELEAFSILQKSGKPILEGA 
ALDEALKTCKTSDLKTPRLDDKET.EKLEDEVQTLLKLKNLKIQR 
RNKCQLMASVTSHKSLRLNAKEEEATKKjKQSQGILNAKITKIS 
NELQALTDEVTQLMMFFRHSNLGQGTNPIjVFLSQFSLEKYLSQE 
EQSTAALTLYTKKQFFQGIHEWESSNESQFFNFLKIQTPSICD 
NQEILEERRLEMARLQLAYICAQHQLIHLKASNSSMKSSIKWAE 
ESLHSLTSKAVDKENLDAKISSLTSEIMKLEKEVTQIKDRSLPA 
WRENAQLLNMPWKGDFDLQIAKQDYYTARQELVLNQLIKQKA 
SFELLQLSYEIELRKHRDIYRQLENT.V EI SNMMr.YKQLEML 
TDPSVSQQINPRNTIDTKDYSTHRLYQVLEGENKKKELFLTHGN 
LEEVAEKLKQNISLVQDQLAVSAQEHSFFLSKRNKDVDMLCDTL 
YQGGNQLLLSDQELTEQFHKVESOLNKLNHLLTDILADVKTKRK 
TLANNKLHQMEREFYVYFLKDEDYLKDIVENLETQSKIKAVSLE 
D 


6530 


128 


298(5 


GAAHHGAIVQVHPLLPGSSTIMIHDLCLVFPAPAKAWYVSDIQ 
ELYIRWDKVEIGKTVKAYVRVLDLHKKPFLAKYFPFMDLKLRA 
ASPIITLVALDEALDNYTITFLIRGVAIGQTSLTASVTNKAGQR 
INSAPQQIEVFPPFRLMPRKVTLLIGATMQVTSEGGPQPQSIJIL 
FSISNESVALVSAAGLVCGLAIGNGTVSGLVQAVDAETGKWII 
SQDLVQVE VLLLRAV3 I RAP IMRMRTGTQMPIYVTGI TNHQNP F 
SFGNAVPGLTFHWSVTKRDVLDLRGRHHEASIRLPSQYNFAMW 
LGRVKGRTGLRAWKAVDPTSGQLYGLARELSDEIQVQVFEKLQ 
LLNPEIEAEQILMSPNSYIKLQTNRDGAASLSYRVLDGPEKVPV 
VHVDEKGFLASGSMIGTSTIEVIAQEPFGANQTIIVAVKVSPVS 
YLRVSMSPVLHTQNKEALVAVPLGMTVTFTVHFHDNSGDVFHAH 
SSVLNFATNRDDFVQIGKGPTNNTCWRTVSVGLTLLRVWDAKH 
PGLSDFMPLPVLQAI S PELSGAMWGDVLCLATVLTSLEGLSGT 
WSSSANSILH1DPKTGVAVARAVGSVTVYYEVAGHLRTYKE\ I W 
SVPQRIMARHLHPIQTSFQEATASIWIVAVGDRSSNLRGECTPT 
QREVIQALHPETLISCQSQFKPAVFDFPSQDVFTVEPQFDTALG 
QYFCSITMHRLTDKQRKHLSMKKTALWSASLSSSHFSTEQVGA 
EVPFSPGLFADQAEILLSNHYTSSEIRVFGAPEVLENLEVKSGS 
PAVLAFAKEKSFGWPSFITYTVGVLDPAAGSQGPLSTTuTFSSP 
VTNQAIAIPVTVAFWDRRGPGPYGASLFQHFLDSYQVMFFTLF 
ALLAGTAVMI IAYHTVCTPRDIiAVPAALTPRASPGHS PHYFAAS 
S ?TS FNALPPARKAS PPSGLWS PAYASH 


6531 


845 


1425 


PSASIPPSASPDPVPDIRTCHFCLVEDPSVGCISGSEKCTISSS 
SLCMVITIYYDVKVRFJVRGCGQYISYRCQEKRNTYFAEYWYQA 
QCCQYDYCNSWSSPQLQSSLPEPHDRPLALPLSDSQIQWFYQAL 
NLSLPLPNFHAGTEPDGLDPMVTLSLNLGLSFAELRRMYLFLNS 
SGLLVLPQAGLLTPHPS 


6532 


2 


954 


AAGPPSE\^/NQDSLFPEPEPGPAPQVLLGPQGPGLIKGVAPPTL | 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid ■ 


Predicted end 
nucleotide 

corresponding 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 

H=Histidine, I=Isoleucine , K=Lysine, Y 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ITDSTGTHLVLTVTNKNAHSPGLSRGSPQQPSSQPGSEAPAPSA 
QMDLEHPLQPLFGTPTSLLKKEPPGYEEAMSQQPKQQENGSSSQ 
QMDDLFDILIQSGEISADFKEPPSLPGKEKPSPKTVCWSPLAAQ 

EPLSLIDDLHSQMLSSTAILDHPPSPMDTSELHFVPEPSSTMGL 
DLADGHLDSMDWLELSSGGPVLSLAPLSTTAPSLFSTDFLDGHD 
LQLHWDSCL 


SS33 


1798 


373 


STISWLARVEPPRRSSGVGAARLRFPGGSRPLPJVRACVLALAVL 

GAKLCSKLKAELKFLQKVEAGKVAIKESHLQSTNLTHLRAIVES 
AENLEEWSVLHVFGYTDTLGEKQTLWDWANGGHTWVKAIGR 
KAEALHNIWLGRGQYGDKSIIEQAEDF1QASHQQPVQYSNPHII 
FAFYNSVSSPMAEKLKEMGISVRGDIVAVNALLDHPEELQPSES 
ESDDEGPELLQVTRVDRENILASVAFPTEIKVDVCKRVNLDITT 

KELFACESAVXDFQSILDTLGGPGEP.ERATVLIKRINWPCQPS 
ERALRLVASSKINSRSLTIFGTGDTLKAITMTANSG?VRAANNQ 
GVKFSVFIHQPRALTESKEALATPLPKDYTTDSEH 


S534 


47 


596 


XATRFISAAFWLNKQGVSPAKLPHTSWSWSLQTLSFLFSGDLA 
EKSLQCFPCSAMLLELIPLLGIHFVLRTARAQEVTQPDIHITVS 
EGASLELRCNYSYGATPYLFWMERTVEEAFILLVCLKPWRVASS 
LEKKEKEDESFQLLLGSRYNVLKAHCLLPLIRKLTSGDSLLSAQ 
PIICPQGL 


6535 


250 


964 


LIKTFFRDVAIQRDLLPKEKNLETLLT1AFLEIDKAFSSHARLS 
ADATLLTSGTTATVALLRDGIELWASVGDSRAILCRKGKPMKL 
TIDHTPERKDEKERIKKCGGFVAWNSLGQPHVNGRLAMTRSIGD 
LDLKTSGVIAEPETKRIKLHHADDSFLVLTTDGINFMV^SQEIW 
DFVNQCHDPNEAAHAVTEQAIQYGTEDNSTAVWPFGAWGKYKN 
SEINFSFSRSFASSGRWA 


S53G 


242 


1174 


SLVKEMTNQYGILFKQEQAHDDAIWSVAWGTNKKENSETWTGS 
LDDLVKVWKWRDERLDLQWSLEGHQLGWSVDISHTLP I AAS S S 
LDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYIATGTHV 
GKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGI 
INIFD IATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIK 
IYDVQHANLAGTLSGHASWVLNVAFCPDDTHFVSSSSDKSVKVW 
DVGTRTCV1ITFFDHQDQVWGVKYNGNGSKIVSVGDDQEIHIYDC 


6537 


1638 


921 


NRFNPPPTQGPDPSLVYRPWDPEVAKDKASFRNYTSGPLLDRV '" 

[ LMH THQTVDFVRSKHAQFGGFSYKKMTVMEAVDLLDGLV 
DESDPDVDFPNSFHAFQTAEGIRKAHPDKDWFHLVGLLHDLGKV 
LALFGEPQWAWGDTFPVGCRPQASWFCDSTFQDNPDLQDPRY 
STELGMYQPHCGLDRVLMSWGHDGEARGGQWGGGGRWGTVGGGG 
AEAVPAGDTLSPQSTCTR 


6538 


3345 


2412 


PYLYDFLDALITCQTAPEEAFIKLDGLAGMLTEQLRRLTKQVQE 
ARHNRDDEAIKKAVNEYDETMEKYIPVLMAQAKIYWNLENYPMV 
EKIFRKSVEFCNDHDVWKLKVAHVLFMQENKYKEAIGFYEPIVK 
KHYDNILNVSArVLAWLCVSYIMTSQNEKAEELMRKIEKEEEQL 
SYDDPNRKMYHLCIVKLVIGTLYCAKGNYEFGI3RVIKSLEPYN 
KKLGTDTWYYAKRCFLSLLENMSKHMIVIHDSVIQECVQFLGHC 
ELYGTNIPAVIEQPLEEERMHVGKNTVTDESRQLKALIYEIIGW 
NK 


5539 




339 


FLGAASPHPHFSSLAPHPDQPEFTPVQDELEAMSLWGPGV 


5540 


3 




LERLWLLLLRRPEDAMAECPTLGEAVTDHPDRLWAWEKFVYLDE 
KQHAWLPLTIEIKDRLQLRVLLRREDWLGRPMTPTQIGPSLLP 
IMWQLYPDGRYRSSDSSFWRLVYHIKIDGVEDMLLELLPDD 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Godon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 


S541 


11S5 


53 6 


RTLVQRRILMLLRKPARGRDLRGRGRGTPRGGRKGLLPTPDEFP 
RFEGGRKPDSWDGNREPGPGHEHFRDTPRPDHPPHDGHSPASRE 
RSSSLQGMDMASLPPRKRPWHDGPGTSEHREMEAPGGPSEDRGG 
KGRGGPGPAQRVPKSGRSSSLDGEHHDGYHRDEPFGGPPGSGTP 
SRGGRSGSNWGRGSNMNSGPPRRGASRGGGRGR 


S542 


3 


3775 


SWPRGRGETGGHPGALRTRTMQKSTOYNEGHALYLAFLARKEGT 
KRGFLSKKTAEASRWHEKWFALYQNVLFYFEGEQSCRPAGMYLL 
EGCSCERTPAPPRAGAGQGGVRDALDKQYYFTVLFGHEGQKPLE 
LRCEEEQDGKEWMEAIHQASYADILIEREVLMQKYIHLVQIVET 
EKIAANQLRHQLEDQDTEIERLKSEIIALNKTKERMRPYQSNQE 
DEDPDIKKIKKVQSFMRGWLCRRKWKTIVQDYICSPHAESMRKR 
NQIVFTMVEAESEYVHQLYILVNGFLRPLRMAASSKKPPISHDD 
VSSIFLNSETIMFLHEIFHQGLKARIANWPTLILADLFDILLPM 
LNIYQEFVRNHQYSLQVLANCKQNRDFDKLLKQYEANPACEGRM 
LETFLTYPMFQIPRYI ITLHELLAHTPHEHVERKSLEFAKSKLE 
ELSRVMHDEVSDTENIRKNLA1ERMIVEGCDI1LDTSQTFIRQG 
SLIQVPSVEEGKLSKVRLGSLSLKKEGERQCFLFTKHFLICTRS 
SGGKLHLLKTGGVLSLIDCTLIEEPDASDDDSKGSGQVFGHLDF 
KIWEPPDRAAFTWLLAPSRQEKAW1SDISQCVDNIRCN3LM 
TIVFSENSKVTVPHMIKSDARLHKDDTDICFSKTLNSCKVPQIR 
YASVERLLERLTDLRFLSIDFLNTFLHTYRIFTTAAVVLGKLSD 
I YKR P FTE I PVRSLELFFATSQNNRGEmVDG KS PRLCRKFSSP 
PPLAVSRTSSPVRARKLSLTSPLNSKI3ALDLTTSSSPTTTTQS 
PAAS P PPHTGQ I PLDLSRGLSS PEQS PGTVEEfl V DNPRVDLCNK 
LKRSIQKAVLESAPADRAGVESSPAADTTELSPCRSPSTPRHLR 
YRQPGGQTADNAHCSVSPASAFAIATAAAGHGSPPGFNNTERTC 

EVLRDPDLLPQERKAAANILMALSQDDQDDIHLKLEDIIQMTDC 
MKAECFESLSAMELAEQITLLDHVIFRSIPYEEFLGQGWMKjDK 
NERTPYIMKTSQHFNDMSNLVASQIMNYADVSSRANAIEKWVAV 
ADI CRCLHNYNGVLE ITSALNRSAIYRLKKTWAKVSKQTKALMD 
KLQKTVSSEGRFKNLRETLKNCNPPAVPYLGMYLTDLAFIEEGT 
PNFTEEGLVNFSKMRMISHIIREIRQFQQTSYRIDHQPKVAQYL 
LDKDLI IDEDTLYELSLKIEPRLPA 


6543 


1S57 




LKVHLQTQQEVKLRMTGMALRWRTDGILALYSGLSASLCRQMT 
YSLTRFAIYETVRDRVAKGSQGPLPFHEKVLLGSVSGLAGGFVG 
TPADLVNVRMQNDVKLPQGQRRNYAHALDGLYRVAREEGLRRLF 
SGATMAS S RGALVT VGQLS C YDQAKQLYLSTG YLS DNIFTHFVA 
SFIAGGCATFLCQPliDVLKTRLMNSKGEYQGVFHCAVETAKLGP 
LAFYKGLVPAGIRLIPHTVLTFVFLEQLRKNFGIKVPS 


6544 


630 


79 


P S F CF I RS RLDGQ PWMAGLEAWLSQNFSLHQPQSR VR VRRAS I S 
EPSDTDPEPRTLNPSPAGWFVQQHPELELMSSFRERFGRNWLQY 
RSHLEPSGNPLPATPTTSAPSAPPASSQGPDTAPRP3PPQEEAR 
GPQES PQKMSEEVRAEPQEEEEEKEGKEEKEEGEMAPLPEAHLG 
EGKQKECP 


6545 


176 




P - -^-KAALLPAAMTPLLTLILWLMGLPLAQALDCHVCAYNGDN 
C?I'l i't'-P.CPAMVAYCMTTRTYYTPTRMK\ r SKSCVPRCFETVYDGY 
S f HAS TTS CCQ YDLCNGTGLATPATLALAP I LLATLWGLL 


6546 


1S57 


354 


HLLNGLDEVAAFFVADLGAIVRKHFCFLKCLPRVRPFYAVKCNS 
SPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQ 
IAQIKYAAKHGIQLLSFDNEMELAKWKSHPSAKKVLCIATDDS 
HS LS CLS LKFGVS LKS CRHLLENAKKHHVE WGVS FHIGSGCPD 
PQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFPGTEGAKVRF 
EEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIA 
KKE VLLDQPGREEENGS TSKT 1 VYHLDEGVYG I FNS VL FD^I CP | 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 

sequence 


Predicted end 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, ^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








TPILQKKPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDW 
LVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEALRRQLMAA 
EQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM 


6547 


1 


541 


LHSKYLAPALCSQPGMMRCCRRRCCCRQPPHALRPLLLLPLVLL 
PP1AAAAAGPNRCDTIYQGFAECLIRLGDSMGRGGELETTCRSW 
NDFHACASQVLSGCPEEAAAVWESLQQEARQAPRPNNLHTLCGA 

RPLA 


6548 




219 


FVSRLSVRDVRFPTFLGGHGADAMHTDPDYSAAYVPIETDAEDG 
IKGCGITFTLGKGTEVGELKILSRFQNA 


5545 


73 


1490 


ETGRVCEDARPACGSRSRRRRKEAAPGIPTPSPSSSSPTSSRPA 
ARAFSKAPARLSRPRAREEPPDPGRRYIQEEIIQARKHKLIKMC 
S SVAAKLWFLTDRR I REDYPQKE I LRALKAKCCEEELDFRAWM 
DEWLTIEQGNLGLRINGELITAYPQWWRVPTPWVQSDSDIT 
VLRHLEKMGCRLMNRPQAILNCVNKFWTFQELAGHGVPLPDTFS 
YGGH3NFAKMIDEAEVLEFPMWKNTRGHRGKAVFLARDKHHLA 
DLSHLIRHEAPYLFQKYVKESHGRDVRVIWGGRWGTMLRCST 

MKDDGSFCVCEANANVGFIAFDKACNLDVAGIIADYAASLLPSG 
RLTRRMSLLSVVSTASETSEPELGPPASTAVDNMSASSSSVDSD 
PESTERELLTKLiPGGLFNMNQLLANEIKLLVD 


6550 


2293 


922 


FKVSRDGAPDCG1LUI i - T ^< * ^RA 5SS 3CWYYLRYFF 

LLGLTASQSNLTKELNFTTRAKDAIMQMWLNARRDLDRINASFR 
C'CQGDRVIYTNWQRYMAAIILSEKQCRDQFKDMNKSCDALLFML 
NQKVKTLEVE IAKEKTI CTKDKESVLLNKRVAEEQLVECVKTRE 
LQHQERQLAKEQLQKVQALCLPLDKDKFEMDLRNLWRDS IIPRS 

IERVARENSDLQRQKLEAQQGLRASQEAKQKVEKEAQAREAKLQ 
AECSRQTQLALEEKAVLRKERDNLAKELEEKKREAEQLRMELAI 
RNSALDTCIKTKSQPMMPVSRPMGPVPNPQPIDPASLEEFKRKI 
LESQRPPAGI PVAPSSG 


6bbl 


157 


748 


IQPPDPRNMTLAAYKEKMKELPLVSLFCSCFLADPLNKSSYKYE 
ADTVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSFDGVPEFN 
ASLPRRRDPSLEEIQKKLEAAEERRKYQEAELLKHLAEKREHER 
EVIQKAIEENNNFIKMAKEKLAQKMESNKENREAHLAAMLERLQ 
EKDKHAEEVRKNKELKEEASR 


6552 


157 


74 8 


IQFPDPRNMTLAAYKEKMKELPLVSLFCSCFLADPLNKSEYKYE 
ADTVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSFDGVPEFN 
ASLPRRRDPSLEEIQKKLEAAEERRKYQEAELLKHLAEKREHER 
EVIQKAIEENNNFIKMAKEKLAQKMESNKENREAHLAAMLERLQ 
EKDKHAEEVRKNKELKEEASR 


6553 


2 


1807 


F VWS KMAAHLS YGR VNLNVLREAVRRELREFLDKCAGS KAI VWD 
EYLTGPFGLIAQYSLLKEHEVEKMFTLKGNRLPAADVKNIIFFV 
RPRLBLMDIIAENVLSEDRRGPTRDFHILFVPRRSLLCEQRLKD 
LGVLGSFIHREEYSLDLIPFDGDLLSMESEGAFKECYLEGDQTS 
LYHAAKGLMTLQALYGTIPQIFGKGECARQVANMMIRMKREFTG 
SQNSIFPVFDNLLLLDRNVDLLTPLATQLTYEGLIDEIYGIQWS 
YVKLPPEKFAPKKQGDGGKDLPTEAKKLQLNSAEELYAEIRDKN 
3V1SKKAKIISAAFEERHNAKTVGEIKQFVSQLPHMQAA 
RGSLANHTSIAELIKDVTTSEDFFDKLTVEQEFMSG1DTDKVNN 
YIEDCIAQKHSLIKVLRLVCLQSVCNSGLKQKVLDYYKREILQT 
YGYEHI LTLHNLEKAGLLKPQTGGRNNYPT IRKTLRLWMDDVNE 
QNPTEISYVYSGYAPLSVRLAQLLSRPGWRSIEEVLRILPGPHF 
EERQPLPTGLQKKRQPGENRVTLIFFLGGVTFAEIAALRFLSQL 
EDG3TEYVIATTKLMNGTSWIEALMEKPF 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 

corresponding 
to first 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6554 


119 


1244 


FBMGSQVSVESGALHWIVGGGFGGIAARSQLQALNVPFMLVDM 
KD S FHHNVAALRAS VE TG FAKKT FIS YS VTFKDNFRQGL WG I D 

AYEDMVRQVQRSRFIVWGGGSAGVSMAAEIKTEYPEKEVTLIH 
SQVALADKELLPSVRQEVXEILLRKGVQLLLSERVSKLEELPLN 
EYREYIKVQTDKGTEVATKLVILCTGIKINSSAYRKAFESRLAS 
SGALRVNEHLQVEGHSNVYAIGDCADVRTPKMAYLAGLHANIAV 
ANIVNSVKQRPLQAYKPGALTFLLSMGRNDGVGQISGFYVGRLM 
VRLTKSRDLFVSTSWKTMRQSPP 


5555 


1552 


498 


IHMALLRKINQVLLFLLIVTLCVILYKKVHKGTVPKNDADDESE 
TPEELEEE I P WI CAAAGRMGATMAAINS I YSNTDANI LFYWG 
LRNTLTRIRKWIEHSKLREINFKIVEFNPMGLKGKIRPDSSRPE 

LGHAAAFSDDCDLPSAQDINRLVGLQNTYMGYLDYRKKAIKDLG 
ISPSTCSFNPGVIVANMTEWKHQRITKQLEKWMQKNVEENLYSS 
SLGGGVATSPMLIVFHGKYSTINPLWHIR1ILGWNPDAKYSEHFL 
QEAKLLHWKGRHKPWDFPSVHNDLWESWFVPDPAGIFKLNHHS 


5555 


241 


1449 


ASLCKGCFFVTHVLVIILPSLQSPPTFGFLLDIDGVLVRCHRVI 
PAALKAFRRLVNSQGQLRVPWFVTNAGNILQHSKAQELSALLG 
CEVDADQVILSHSPMKLFSEYHEKRMLVSGQGPVMENAQGLGFR 
NVVTVDELRMAFPLLDMVDLERRLKTTPLPRNDFPRIEGVLLLG 

KMAEAKMPRFGHGTFLLCLETIYQKVTGKELRYEGLMGKPSILT 
YQYAEDLIRRQAERRGWAAPIRKLYAVGDNPMSDVYGANLFHQY 
LQKATHDGAPELGAGGTRQQQPSASQS C I S ILVCTGVYNPRNPQ 
STEPVLGGGEPPFHGHRDLCFSPGLMEASHWNDWEAVQLVFR 
KEGWALE 


6557 


2598 


1534 


RMCGRTSCHLPRDVLTRACAYQDRRGQQRLPEWRDPDKYCPSYN 
KSPQSNSPVLLSRLHFEKDADSSERI IAPMRWGLVPSWFKESDP 
SKLQFNTTNCRSDTVMEKRSFKVPLGKGRRCVVLADGFYEWQRC 

MAGIFDCWEPPEGGDVLYSYTIITVDSCKGLSDIHHRMPAILDG 
EEAVSKWLDFGEVSTQEALKLIHPTENITFHAVSSWNNSRNNT 
PECLAPVDLWKKELRASGSSQRMLQWLATKSPKKEDSKTPQKE 
ESDVPQWSSQFLQKSPLPTKRGTACLLEQWLKREKEEEPVAKRP 
Y5Q 


6558 


21 


1138 


FHGRRHGGRKMELGSCLEGGREAAEEEGEPEVKKRRLLCVEFAS 
VASCDAAVAQCFLAENDWEMERALNSYFEPPVEESALERRPETI 
SEPKTYVDLTNEETTDSTTSKISPSEDTQQENGSMFSLITWNID 
GLDLNNLS ERARGVCSYLALYS PDVI FLQEVI PP YYS YLKKRSS 
NYEIITGHEEGYFTAIMLKKSRVKLKSQEIIPFPSTKMMRNLLC 
VHVNVSGNELCLMTSHLESTRGHAAERMNQLKMVLKKMQEAPES 
ATVIFAGDTNLRDREVTRCGGLPHNIVDVWEFLGKPKHCQYTWD 
TQMNSNLGITAACKLRFDRIFFRAAAEEGHIIPRSLDLLGLEKL 
DCGRFPSDHWGLLCNLDI IL 


5559 




364 


5 r W,T,PTRPKKLKAWQTPIAMDCCASRSCSVPTGPATTICSS 
DKSCRCGVCLPSTCPHTVWLLEPTCCIINCPPPCHIPQPCVPTCF 
LLNSCQPTPGLETLNLTTFTQPCCEPCLPRGC 


6560 


3 


1435 


"3 1 F CHW PRPLPQS CVGTEGGLQVRDTS SRI AKG 
3 /jHTIvMSLHGASGGHERSRDRRRSSDRSRDSSHERTESQLTPC 
IRNVTSPTRQHHVEREKDHSSSRPSSPRPQKASPNGSISSAQNS 
S RN3 3QS3 SDGSCKTAGEMVFVYENAKEGARNIRTSERVTLIVD 
NT] r 'VDPSIFTAQPNTMLGRMFGSGREHNFTRPNEKGEYEVAE 
GIGSTVFRAILDYYKTGIIRCPDGISIPELREACDYLCISFEYS 
TIKCRDLSALMHELSNDGARRQFEFYLEEMILPLMVASAQSGER 
ECHIWLTDDDVVDWDEEYPPQMGEEYSQIIYSTKLYRFFKYIE 
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NO: 
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beginning 

nucleotide 
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corresponding 

to first 

amino acid 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=l>ysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonlne, V=Valine, 
W=Tryptophan , Y=Tyrosine, X=Unknown, *=Stop 
Codcn, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








NRDVAKSVLKERGLKKIRLGIEGYPTYKEKVKKRPGGRPEVIYN 
YVQRPFIRM3WEKEEGKSRHVDFQCVKSKSITKLAAAAADIPQD 
QLWMHPTPQVDELDILPIHPPSGNSDLDPDAQNPML 


65S1 


3 




PGRRFRRKESSSSRWFPADCLLGLRGPASSLLSPEPSPSWPSHS 
P CPMAALTDLS FMYRWFKNCNLVGNLSEKYVF ITGCDSGFGNLL 
AKQLVDRGMQ VLAACFTEEGS Q KLQRDTS YRLQTTLLDVTKS E S 
IKAAAQWVRDKVGEQGLWALVNNAGVGLPSGPNEWLTKDDFVKV 
INVNLVGLI E VTLHMLPMVKRARGRWNMS S SGGRVAVIGGGYC 
VSKFGVEAFSDSIRRELYYFGVKVCIIEPGNYRTAILGKENLES 
RMRKLWERLPQETRDSYGEDYFRIYTDKLKNIMQVAEPRVRDVI 
NSMEHA1VSRSPRIRYNPGLDAKLLYIPLAKLPTPVTDFILSRY 
LPRPADSV 


bbb2 


1 


1562 


MSTLYDIRAHKAQLLRFFASSDSNKALEQRRTmTPKLEHLDRV 
LYEWFLGKRSEGVPVSGPMLIEKAKDFYEQMQLTEPCVFSGGWL 
WRFKARHGIKKLDASSEKQSADHQAAEQFCAFFRSLAA3HGLSA 
EQVYNADETGLFWRCLPNPTPEGGAVPGPKQGKDRLTVLMCANA 
TGSHRLKPLAIGKCSGPRAFKGIOHLPVAYKAQGNAWVDKFIFS 
DWFHHIFVPSVREHFRTIGLPEDSKAVLLLDSSRAHPQ3AELVS 
SNVFTI FLPASVAS LVQPMEQGI RRDFMRNF INPPVPLQGPHAR 
YMMMDAIFSVACAWKAVPSHVFRRAWRKLWPSVAFAEGSSEEEE 
LEAECFPVKPHNKSFAHILELVKEG3SCPGQLRQRQAASWGVAG 
REAEGC-RPPAATSPAEWWSSEKTPKADQDGRGDPGEGEEVAWE 
QAAVAFDAVLRFAF.RQPCFSAQEVGQLRALRAVFRSQQQVRRRR 
GALGAWKVEALQEGPGGCGATAQSPLPCSSTAGDN 


SbS3 


1319 


2694 


LARPAQPVLLREPEGAGPPVPAGHLVHHLQGGHLKEHAHPDLEA 
HEHPLPCDQMFWRQMGGHLRMVEflNSRGWWGIGYDHTAWVYTG 
GYGGGCFQGLASSTSNIYTQSDVKCVHIYENQRWNPVTGYTSRG 
LPTDRYMWSDASGLQECTKA3TKPPSLQWAWVSDWFVDFSVPGG 
TDQEGWQYASDFPASYHGSKTMKDFVRRRCWARKCKLVTSGPWL 
EVP P IALRDVS 1 1 PESPGAEGSGHS IALWAVSDKGDVLCRLGVS 
ELNPAGSSWLHVGTDQPFASISIGACYQVWAVARDGSAFYRGSV 
YPSQPAGDCWYHIPSPPRQRLKQVSAGQTSVYALDENGWLWYRQ 
GITPSYPQGSSBEHVSNNVCRVSVGPLDQVWVIANKVQGSHStS 
RGTVCHRTGVQPHEPKGHGWDYGIGGGWDHISVRANATRAPRSS 
SQEQEPSAPPEAHGPVCC 


6564 


1 


975 


APGSCALWSYCGRGWSRAMRGCQLLGLRSSWPGDLLSARLLSQE 
KRAAETHFGFETVSEEEKGGKVYQVFESVAKKYDVMNDMMSLGI 
HRVWKDLLLKIuTIHPLPGTQLLDVAGGTGDlAFRFLNYVQSQHQR 
KQKRQLRAQQNLSWEEIAKEYQNEEDSLGGSRVWCDINKEMLK 
VC-KQKALAQGYRAGLAWVLGDAEELPFDDDKFDIYTIAFGIRNV 
THIDQALQEAHRVLKPGGRFLCLEFSQVNNPLISRLYDLYSFQV 
IPVLGEVIAGDWKSYQYLVESIRRFPSQEEFKDMIEDAGFHKVT 
YESLTSGIVAIHSGFKL 


5565 


1464 


999 


RSAVANGLTKRRMGLKLKGRYISLILAVQIAYLVQAVRAAGKCD 
AVFKGFSDCLLKLGD3MANYPQGLDDKTNIKTVCTYWEDFHSCT 
VTALTDCQEGAKDMWDKLRKESKNLWIQGSLFELCGSGNGAAGS 
LLPAFPVLLVSLSAALATWLSF 


6566 


3 


1385 


1 "1 --OPGGTQPEPGLGARMAIHKALVMCLGLPLFLFPGAWAQG 
HVPPGCSQGLNPLYYNLCDRSGAWGIVLEAVAGAGIVTTFVLTT 
ILVASLPFVQDTKKRSLLGTQVFFLLGTLGLFCLVFACVEKPDF 
STCASRRFLFGVLFAICFSCIiAAHVFALNFLARKNHGPRGWVIF 
TVALLLTLVEVI INTEWLI ITLVRGSGEGGPQGNSSAGWAVASP 
CAIANMDFVMAL I YTOLLL1GAFLGAWPALCGRYKRWRKHGVFV 
LLTTATSVAIWVVWIi/MYTYGNKQHNSPTWDDPTLAIALAAWAW 
AFVLFYVIPEVSQVTKSSPEQSYQGDMYPTRGVGYETILKEQKG 
Q3MFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTEMAL 
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An.i n i - id egenent containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
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P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MHKVPSEGAYDIILFRATAKTSQVMGSANSTLRAEDMYSAQSHQA 
AT P P KDGKNS Q VFRNP YVWD 


65S7 


125 


863 


TKRSNLKAYACS IHHIRTMSYVFVJMDSSQTNVPLLQACIDGDFN 
Y S KR T ,T ,ES GFDPNIRDSRGRTGLHLAAARGNVD I CQLLHKFGAD 
LLATDYQGNTALHLCGHVDTIQFLVSNGLKIDICNHQGATPLVL 
AKRRGVNKDVIRLLESLEEQEVKGFNRGTHSKLETMQTAESESA 
MESHSLLNPNLQQGEGVLSSFRTTWQEFVEDLGFWRVLLLIFVI 
ALLSLGIAYYVSGVLPFVENQPELVH 




3 


1183 


HASDRLLVLPDNYSHFSQASANLQGPSRTTELFHPTLASISSPM 
LEGAELYFN\'DHGYLEGLVRGCKASLLTQQDYINLVQCETLEDL 
KIHLQTTDYGNFLANHTMPLTVSKIDTEMRKRLCGEFEYFRNHS 
LEPLSTFLTYMTCSYM1DHVILLMNGALQKKSVKEILGKCHPLG 
RFTEMEAVNIAETPSDLFNAILIETPLAPFFQDCMSENALDELN 
IELLRNKLYKSYLEAFYKFCKNHGDVTAEVMCPILEFEADRRAF 
IITLNSFGTELSKEDRETLYPTFGKLYPEGLRLLAQAEDFDQMK 
NVADHYGVYKPLFEAVGGSGGKTLEDVFYEREVQMNVLAFNRQF 
HYGVFYAYVKLKEQE1RNI VWIAECISQRHRTK TNS YI P IL 




205 


1532 


RRRGPQRLGEGRPTPLLCRWRTAGPSHWEKQARAFQ3LRPVDPR ~ 

RMSWLFPLTKSASSSAAGSPGGLTSLQQQKCRLIESLRNSHSSI 

AE1QKDVEYRLPFTINNLTININILLPPQFFQEKPVISVYPPIR 

HHLMDKQGVYVTSPLVNNFTMHSDLGKIIQSLLDEFWKNPPVLA 

PTSTAFPYLYSNPSGMSPYASQGFPFLPPYPPQEANRSITSLSV 

ADTVSSSTTSHTTAKPAAPSFGVLSNLPLPIPTVDASIPTSQNG 

FGYKMPDVPDAFPELSELSVSQLTDMNEQEEVLLEQFLTLPQLK 

QIITDKDDLVKSIEELARKMLLLEPyLEAKRQTVLDKYELLTQM 

KSTFEKKMQRQHELSESCSASALQARLKVAAHEAEEESDNIAED 

FLEGKMEIDDFLSSFMEKRTICHCRRAKEEKLQQAIAMHSQFHA 

PI, 


6570 


330 


1304 


ARLPRLTFLREGFLYVIjLSHWVFVGAPRPPASDSWKXGLVPSAP 
PASRKMGSKALPAPIPLHPSLQLTNYSFLQAVNTFPATVDHLQG 
LYGLSAVQTMHMNHWTLGYPIIVHEITRSTITEMAAAQGLVDARF 
PFPALPFTTHLFHPKQGMAHVLPALHKDRPRFDFANLAVAATQ 
EDPPKMGDLSKLSPGLGSPISGLSKLTPDRKPSRGRLPSKTKKE 
FICKFCGRHFTKSYNLLIHERTHTDERPYTCDICHKAFRRQDHL 
RDHRYIHSKEKPFKCQECGKGFCQSRTLAVHKTLHMQTSSPTAA 
SSAAKCSGETVI CGGT 


6571 


169 




APJMKRKKLQKLTDTLTKKCKHLFRC-FDKDNDGCVFi/LEWIHGL 
SLFLRGSLEEKMKYCFEVFDLNGDGFISKEEMFHMLKNSLLKQP 
SEEDPDEGIKDLVEITLKKMDHDHDGKLSFADYELAVREETLLL 
EAFGPCLPDPKSQME FEAQVFKDPNE FMDM 


6572 


49 


1646 


TPERAQPGALLGAAGCCVCGGRWWPRSHERGYFSSAKMGSKRRN 
LSCSERHQKLVDENYCKKLHVQAIjKNVNSQIRNQMVQNENDNRV 
QRKQFLRLLQNEQFELDMEEAIQKAEEMKRLKELQLKQEEKLAM 
ELAKLKHESLKDEKMRQQVRENSIELRELEKKLKAAYMWKERAA 
QIAEKDAIKYEQMKRDAEIAKTMMEEHKRIIKEENAAEDKRNKA 
KAQYYLDLEKQLEEQEKKKQEAYEQLLKEKLMIEEIVRKIYEED 
QLEKQQKLEKMNAMRRYIEEFQKEQALWRKKXREEMEEENRKII 
EFANMQQQREEDRMAKVQENEEKRLQLQNALTQKLEEMLRQRED 
LEQVRQELYQ3EQAEXYKSKLKEEAEKKLRKQKEMKQDFEEQMA 
LKELVLQAAK3EEENFRKTMLAKFAEDDRIELMNAQKQRMKQLE 
HRRAVEKLIEERRQQFIAOKQRELEEWQLQQRRQGFINAIIEEE 
RLKLLKEHATNLLGYLPKGVFKKEDDIDLLGEEFRKVYQQRSEI 
CEEK 


6573 


767 


275 


GGGG3ESQSFRAQDGTRTPATDCLMYLQGPRKLMTQGGYDMVQK 
LFLDFFRRRLSQRPTAEELEQRKILKPRNEQEEQEEKREIKRRL 
TRKL32RPTVEELRERKILIRFSDYVEVADAQDYDRRADKPWTR 
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ID 
NO: 


Predicted 
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to first 
amino acid 

sequence 


Predicted end 
nucleotide 

corresponding 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glyc'ine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, H=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unk:nown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








LTAADKVSRGECWRVGGRTVCWVSLGSPLGSV 


6574 


204 


1159 


LESSVPVSVGVFWACGVSWTGAAGLQDGALSDTMARNAEKAMTA 
LARFRQAQLEEGKVKERRPFLASE CTELPKAEKWRRQI I SE1S K 
KVAQIQNAGLGEFRIRDLNDEINKLLREKGHWEVRIKELGGPDY 
GKVGPKMLDHEGKEVPGNRGYKYFGAAKDLPGVRELFEKEPLPP 
PRKTRAELMKAIDFEYYGYLDEDDGVIVPLEQEYEKKLRAELVE 
KWKAEREARLARGEKEEEEEEEEEINIYAVTEEESDEEGSQEKG 
GDDSQQKFIAHVPVPSOQEIEEALVRRKICMELLQKYASETLQAQ 
SEEARRLLGY 


6575 


117 




SPAIASQSGGITEEKMLEPQ.ENGVIDLPDYEHVEDETFPPFPPP 
ASPERQDGEGTEPDEESGNGAPVPVPPKRTVKRNIPKLDAQRLI 
SERGLPALRHVFDKAKFKGKGHEAEDLKMLIRHMEHWAHRLFPK 
LQFEDFIDRVEYLGSKKEVQTCLKRIRLDLPILHEDFVSNNDEV 
AENNEHDVTSTELDPFLTNLSESEMFASELSI3LTEEQQQRIER 
NKQLALERRQAKLP 


6576 




». 


PEPQALVGQXRGALRLLVARLVLTVSAPAEVRRRVLRPVLSWMD " 

RETRALADSHFRGLGVDVPGVGQAPGRVAFVSEPGAFSYADFVR 

GFLLPNLPCVFSSAFTQGWGSRRRWVTPAGRPDFDHLLRTYGDV 

WPVANCGVQEYNSNPKEHMTLRDYITYWKEYIQAGYSSPR3CL 

YLKDWHLCRDFPVEDVFTLPVYFSSDWLNEFWDALDVDDYRFVY 

AGPAGSWSPFHADIFRSFSWSVHVCGRKKWLLFPPGQEEALRDR 

HGNLPYDVTSPALCDTHLHPRNQLAGPPLEITQEAGEMVFVPSG 

WHHQVHNLVMCCFSCPLSGAFLQEDGSTTSPLSQPELGWNGVAH 

G 


6577 


2271 


987 


SDRMASDDFDIVIEAMLEAPYKKEEDEQCRKEVKKDYPSNTTSS 
TSNSGNETSGSSTIGETSNRSRDRDRYRRRNSRSRSPGRQCRHR 
SRSWDRRHGSESRSRDHRREDRVHYRSPPLATGYRYGHSKSPHF 
REKSPVREPVDNLSPEERDARTVFCMQLAARIRPRDLEDFFSAV 
GKVRDVRIISDRNSRRSKGIAYVEFCEIOSVPLAIGLTGQRLLG 
VPI I VQASQAEJOTRIAAMANNLQKGNGGPMRLYVGSLHFNI TED 
MLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFITFSDSECARRAL 
EQLNGFELAGRPMRVGHVTERLDGGTDITFPDGDQELDLGSAGG 
RFQLMAKLAEGAGIQLPSTAAAAAAAAAAQAAALQLNGAVPLGA 
LNPAALTALSPALNLASQCLQLSSLFTPQTM 


6578 


377 




I 1 RAF LiK~ r I _h p p LT A D-S " "1 j j i l i F l LP ^ 
LQIALWSLYWVTSISMVFLNKYLLDSPSliRLDTPIFVTFYQCL 
VTTLLCKGLSALAACCPGAVDFPSLRLDLRVARSVLPLSVVFIG 
MITFWWLCLKYVGVAFYNVGRSLTTVFNVLLSYLLLKQTTSFYA 
LLTCGI I IGGFWLGVDQEGAEGTLSWLGTVFGVLASLCVSLNAI 
YTTKVLPAVDGSIWRLTFYNNVNACILFLPLLLLLG3LQALRDF 
AQLGSAHFWGMMTLGGLFGFAIGYVTGLQIKFTSPLTHNVSGTA 
KACAQTVLAVLYYEETKSFLWWTSNMMVLGGSSAYTWVRGWEMK 
KTPEEPSPKDSEKSAMGV 


6579 


2 


711 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVFYFTSSSVNSSAYT 
IYMGKDKYENEDLIKHGWPEDIWFHVDKLSSAHVYLRLHKGENI 
EDIPKEVLMDCAHLVKAWSIQGCKMMTVWWYTPWSNLKKTADM 
DVGQIGFHRQKDVKIVTVEKKVNEILNRLEKTKVERFPDLAAEK 
ECRDREERNEKKAQIQEMKKREKEEMKKKREMDELRSYSSLMKV 
ENMSSNQDGNDSDEFM 


6580 


62 


1571 


LVALKNWKPKGTNIPAPQSPVFGEAVSGVYMMTKVLGMAPVLGP 
RPPQEQVGPLMVKVEEKEEKGKYLPSLEMFRQRFRQFGYHDTPG 
PREALSQLRVLCCEWLRPEIHTKEQILELLVLEQFLTILPQELQ 
AKVQEHCPESAEEAVTLLEDLERELDEPGHQVSTPPNEQKPVWE 
KISSSGTAKESPSSMQPQPLETSHKYESWGPLYIQESGEEQEFA 
QDPRKVRDCRLSTQHEESADEQKGSEAEGLKGDIISVIIANKPE 
ASLERQCVNLENEKGTKPPLQEAGSKKGRESVPTKPTPGERRYI 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanxne, C=Cysteine, D=Aspartic Acid, E = 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagi ne, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








CAEC3KAFSNSSNLTKHRRTHTGEKPYVCTKCGKAFSHSSNLTL 
HYRTHLVDRPYDCKCGKAFGQSSDLLKHQRMHTEEAPYQCKDCG 
KAFS 3KGS L I RH YR I HTSEKP YQ CHE CGKS FSQHAGLS 3HQRLH 
TGEKPYKCKECGKAFNHSSNFNKHHRIHTGEKPYWCHHCGKTFC 
SKSNLSKHQRVHTGEGEAP 


6581 


228 


47S 


RVFLKDLSSTPMASMNTASIAQARKLVEQLKMEANIDRIKVSKA 
AADLMAYCEAHAKEDPLLTPVPASENPFREKKFFCAIL 


G582 


142S 


718 


CFTTKTHCSPVSVPYLSPLVLRKELESLLENEGDQVIHTSSFIN 
QHPIIFWTLVWYFRRLDLPSNLPGLILTSEHCNEGVQLPLSSLS 
QDSKLVYIQLLWDNINLHQEPREPLYVSWRNFNSEKKSSLLSEE 
QQETSTLVETIRQSIQHNNVLKPINLLSQQMKP3MKRQRSLYRE 
ILFLSLVSLGRENIDIEAFDNEYGIAYNSLSSEILERLQKIDAP 
PSASVEWCRKCFGAPLI 


GBE.i 


487 




RIFSMTSGRLRWRCTWRPATALWSASLRLGTSSMHPSPRSISLP 
LSMMLSPLPSNTRGLSPTALFRSPDSEEATSCPRLKLWRCRAPL 
RSPSPLGRLQVLPKSPLHVHTHKSGKEVLGLQVQRSRSGTGPAC 
SQAGSGAVQGGNWC I F 


5584 




1750 




PLPK.AALGPSSQNVTEYWRVPIMTTKKYNIMAFNAADKVNFAT 
WNQARLEKDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYG 
IVLKF.FRPEDQPWT.T.RVICGKSGRKFKGIKKGGVTENTSYYIFTQ 
CPDGAFEAFPVHNWYNFTPLARHRTLTAEEAEEEWERRIIKVLNH 
FSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDT.EDDLEMS 
SDASDASGEEGGRVPKAKKKAPLAKGGRKKKKKXGSDDEAFEDS 
DDGDFEGQEVDYMSDGSSSSQEEPESKAKAPQQEEGPKGVDEQS 
DSSEESEEEKPPEEDKEEEEEKKAPTPQEKKRRKDSSEESDSSE 
ESDIDSEASSAFFMAKKKTPPKRERKPSCCSSRGNSRPGTPSAE 
GGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKST 
PQPPSGKTTPNSGDVQVTEDAVRRYLTRKPMTTKDLLKKFQTKK 
TGLSSEQTVNVLAQILKRLNPERKMINDKNHFSLKE 


6585 


3 




GPIRNSRIDDFVGGDPRAEASCSVLHSKPHAMADSRDPASDQMQ 
HWKEQRAAQKADVLTTGAGNPVGDKLNVITVGPRGPLLVQDWF 
TDEMAHFDRER I PERWHAKGAGAFGYFEVTHD ITKYS KAKVFE 
HIGKKTPIAVRFSTVAGESGSADTVRDPRGFAVKFYTEDGNWDL 
VGNNTP1FFIRDPILFPSFIHSQKRNPQTHLKDPDMVWDFWSLR 
PESLHQVSFLFSDRGIPDGHRHMNGYGSHTFKLVNANGEAVYCK 
FHYKTDQGIKWLSVEDAARLSQEDPDYGIRDLFIIAIATGKYPSW 
TFYIQVMTFNQAETFPFNPFDLTKVWPHKDYPLIPVGKLVLWRN 
PVNYFAE VEQ IAFDPSNMP PG IEAS PDKMLQGRLFAYPDTHRHR 
LGPNYLHIPVNCPYRARVANYQRDGPMCMQDNQGGAPNYYPHSF 
GAPEQQPSALEHSIQYSGEVRRFNTANDDNVTQVRAFYVNVLNE 
EQRKRLCENIAGHLKDAQIFIQKKAVKNFTEVHPDYGSHIQALL 
DKYNAEKPKNAIHTFVQSGSHLAAREKANL 


S58S 


32 


804 


PLPEQPASSTSTMPVSGTPAPNKKRKSSKLIMELTGGGQESSGL 
NLGKKISVPRDVMLEELSLLTNRGSKMFKLRQMRVEKFIYEWHP 
DVFSDSSMDHFQKFLPTVGGQLGTAGQGFSYSKSNGRGGSQAGG 
SGSAGQYGSDQQHHLGSGSGAGGTGGPAGQAGRGGAAGTAGVGE 
TGSGDQAGGEGKHITVFKTYISPWERAMGVDPQQKMELGIDLLA 
YGAKAELPKYKS FNRTAMP YGGYEKAS KRMTFQMPKV 


S587 


75 


1117 


P.RVFSLGKMPECWDGEHDIETPYGLLHWIR3SPKGNRPAILTY 
HDVGLNHKLCFNT F FNFEDMQE I TKHF WCHVDAPGQQVGAS Q F 
PQGYCFPSMEQLAAMLPSWQHFGFKYVIG1GVGAGAYVLAKFA 
LIFPELVEGLVLWIDPNGKGWIDWAATKLSGLTSTLPDTVLSH 
LFSQEELVNNTELVQSYRQQIGNWNQANLQLFKNMYNSRRDLD 
INRPGTVPNAKTLRCPVMLWGDNAPAEDGWECNSKLDPTTTT 
FLKMADSGGLPQVTQPGKLTEAFKYFLQGMGYMPSASMTRLARS 
RTASLTSASSVDGSRPQACTHSESSEGLGQVNHTMEVSC 
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NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


S588 


13 7 


501 


LGLQAQLLELRTNNYQLSDELRKNGVELTSLRQKVAYLDKBFSK 
AQKALSKSKKAQEVEVLLSENEMLQAKLHSQEEDFRLQNSTLMA 
EFSKLCSQMEQLEQENQQLKEGAAGAGVAQAGP 


6589 


2 


140 r > 


RPWGSAMATFSRQEFFQQLLQGCLLPTAQQGLDQIWLLLAICLA 
CRLLWRLGLPSYLKHASTVAGGFFSLYHFFQLHMVWVVLLSLLC 
YLVL FL CRHS SHRG VFLS VT I L I YLLMGEMHMVDTVTWHKKRGA 
QMIVAMKAVSLGFDLDRGEVGTVPSPVEFMGYLYFVGTIVFGPW 
ISFHSYLQAVQGRPLSCRWLQKVARSLALALLCLVLSTCVGPYL 
FP YF I PLNGDRLLRNKKRKARGTMVRWLRAYESAVS FHFSNYFV 
GFLSEATATLAGAGFTESKDHLEWDLTVSKPLNVELPRSMVEW 
TSWNLPMSYWLNNYVFKNALRLGTFSAVLVTYAASALLHGFSFH 
LAAVLLSLAFITYVEHVLRKRLARILSACVLSKRCPPDCSHQHR 
LGLGVRALNLLFGALAIFHLAYLGSLFDVDVDDTTEEQGYGMAY 
ILSWASHWVTFGCWIFYRLIG 


S590 


2177 


656 


VRAYEHVLSLLENVFTPMFCHRDEYFRQLLRGAESPTRNSKLNR 
GSLSLDDFRNTQKRGESFGISRIGSKIKGVFKSTTMEGAMLPNY 
GVAEGEDDFIEEGIWMEDDSPVEAVSTPNTPRNLAAWKISIPY 
VDFFEDPSSERKEKKERIPVFCIDVERNDRRATCHEPEHWSVYR 
RYLEFYVLESKLTEFHGAFPDAQLPSKRIIGPKNYEFLKSKREE 
FQHYLQKLLQHPELSNSQLLADFLSPNGGETQFLDKILPDVNLG 
KIIKSVPGKLMKEKGQHLEPFIMNFINSCESPKPKPSRPELTIL 
SPTSENNKKLFNDLFKHNANRAENTERKQNQNYFMEVMTVEHVY 
DYLMYVGRWFQVPDWLHHLLMGTRILFKNTLEMYTDYYLQCKL 
EQLFQEHRLVSLITLLRDAIFCENTEPRSLQDKQKGAKQTFEEM 
MNYIPDLLVKCIGEETKYESIRLLFIX3LQQPVLNKQLTYVLLDI 
VIQELFPELNKVQKEVTSVTSWM 


6591 


2177 


656 


VRAYEHVLSLLENVFTPMFCHRDEYFRCLLRGAESPTRNSKLNR 
GSLSLDDFRNTQKRGESFGISRIGSKIKGVFKSTTMEGAMLPNY 
GVAEGEDDFI EEG IWMEDDSPVEAVSTPNTPRNLAAWKI S IPY 
VDFFEDPSSERKEKKERIPVFCIDVERNDRRAVGHEPEHWSVYR 
RYLEFYVLESKLTEFHGAFPDAQLPSKRIIGPKNYEFLKSKREE 
FQEYLQKLLQHPELSNSQLLADFLSPNGGETQFLDKTLPDVNLG 
KIIKSVPGKLMKEKGQHLEPFIMNFINSCESPKPKPSRPELTIL 
SPTSENNKKLFNDLFKNNANRAENTERKQNQNYFMEVMTVEGVY 
DYLMYVGRWFQVPDWLHHLLMGTRILFKNTLEMYTDYYLQCKL 
EQLFQEHRLVSLITLLRDAIFCENTEPRSLQDKQKGAKQTF3EM 
MNYIPDLLVKCIGEETKYESIRLLFDGLQQPVLNKQLTYVLLDI 
VIQELFPELNKVQKEVTSVTSWM 


6592 


3 


1861 


APEFLGSTISSGSMIDANLKLLQEAEQRLKAIVAEKFAIATKEG 
DLPQVERFFKIFPLLGLHEEGLRKFSEYLCKQVASKAEENLLMV 
LGTDMSDRRAAVIFADTLTLLFEGIARIVETHQPIVETYYGPGR 
LYTL I KYLQVECDRQVEKWDKF IKQRDYHQQFRHVQNNLMRNS 
TTEKIEPRELDPILTEVTLMNARSELYLRFLKKRISSDFEVGDS 
MASEEVKQEHQKCLDKLLNNCLLSCTMQELIGLYVTMEEYFMRE 
TVNKAVALDTYEKGQLTS SMVDDVFYI VKKC IGRALSSSS IDCL 
CAMINLATTELESDFRDVLCNKLRMGFPATTFQDIQRGVTSAVN 
IMHSSLQQGKFDTKGIESTDEAKMSFLVTLNNVEVCSENISTLK 
KTLEEDCTICLFSQGIGGEQAQAKFDSCLSDLAAVSNKFRDLLQE 
GLTELNSTAIKPQVQPWINSFFSVSHNI3EEEFNDYEANDPWVQ 
QFILNLEQQMAEFKASLSPVIYDSLTGLMTSLVAVELEKWLKS 
TFNRLGCLQFDKELRSLIAYLTTVTTWTIRDKFARLSQMATILN 
LERVTEILDYWGPNSGPLTWRLTPAEVRQVLALRIDFRSEDIKR 
LRL 


6593 




1837 


ISRRRGLALQRGVLGGLGGYCPCCCRRRGRLLVLLLLVR 
RGGEGGGGRGRGDKRRRRQARRQRRRPEPAEARGGKMADVLSVL 
RQYNIQKKEIWKGD3VIFGEFSWPKNVKTNYWWGTGKEGQPR 
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SEQ 
ID 


Predicted 
beginning 

corresponding 
to first 

residue of 

sequence 


Predicted end 

corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidinc, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








E n LDSI-iFLLNNVHLSHPVYVRRAATENIPWRRPDRKDLLG 
YLNGEAS TSAS IDRS APLE I GLQRSTQVKRAADEVLAEAKKPRI 
EDEECVRLDKERLAARLEGHKEGIVQTEQIRSLSEAMSVEKIAA 
IKAKIMAKKRSTIKTDLDDDITALKQRSFVDAEVDVTRDIVSRE 
RVWRTRTTILQSTGKNFSKNIFAILQSVKAHEEGRAPEQRPAPN 
AAPVDPTLRTKQPIPAAYNRYDQERFKGKEETEGFKIDTMC-TYH 
GMTLKSVTEGASARKTQTPAAQPVPRPVSQARPPPMQKKGSRTP 
IIIIPAATTSLITMLNAKDLLQDLKFVPSDEKKKQGCQRENETL 
IQRRKDQMQPGGTAISVTVPYRWDQPLKLMPQDWDRWAVFVQ 
GPAWQFKGWPWLLPDGS PVD I FAKI KAFHLKYDEVRLDPNVQKW 
DVTVLELSYHKRHLDRPVFLRVWETLDRYMVKHKSHLRF 


SS94 




1096 


EFPGRRFRGSQASPLCATCGPALLRAPTRAAMTRSLFKGNFWSA 
DILSTIGYDNIIQHLNNGRKNCKEFEDFLKERAAIEERYGKDLL 
NLSRKKPCGQSEINTLKRALEVFKQQVDNVAQCHIQLAQSLREE 
ARKMEEFREKQKLQRKKTELIMDAIHKQKSLQFKKTMDAKKNYE 
QKCRDKDEAEQAVSRSANLWPKQQEKLFVKLATSKTAVEDSDK 
AYMLHIGTLDKVREEWQSEHIKACEAFliAOECERINFFRNALWL 
HVNQLSQQCVTSDEMYEQVRKSLEMCSIQRDIEYFVNQRKTGQI 
PPAPIMYENFYSSQKNAVPAGKATGPNLARRGPLPIPKSSPDDP 
L IT \ SliLYQ 


5595 


57 


781 


1 LGT1 'JE5DL33DEGLLSLAGKRKRRGNI.PKESVKILRDWLYLH 
RYNAYPSEQEKLSLSGQTNLSVLQ I CNWFINARRRLLPDMLRKD 
GKDPNQFTISRRGGKASDVALPRGSSPSVIAVSVPAPTIIVLSLS 
VCSMPLHSGQGEKPAAPFPR3ELESPKPLVTPGSTLTLLTRAEA 
GSPTGGLFNTPPPTPPEQDKEDFSSFQLLVEVAIjQRAAEMELQK 
QQDPSLPLLHTP I PLVSENP3 


6596 


2 


1 026 


PRLPVRRYHGRRRLQGRSRGHMAEGDAGSDQRQNEEIEAMAAIY 
GEEWCVIDDCAKIFCIRISDDIDDPKWTLCLQVMLPNEYPGTAP 
PIYQLNAPWLKGQERADLSNSLEEIYIQNIGESILYLWVEKIRD 
VLIQKSQMTEPGPDVKKKTEEEDVECEDDLILACQPESSVKALD 
FDISETRTEVEVEELPPIDHGIPITDRRSTFQAHLAPWCPKQV 
KMVLSKLYENKKIASATHNIYAYRIYCEDKQTFLQDCEDDGETA 
AGGRLLHLMEILNVKNVMVWSRWYGGILLGPDRFKHINNCARN 
ILVEKNYTNSPEESSKALGKNKKVRKDKKRNEH 


6597 


2 


1026 


PRLPVRRYHGRRRLQGRSRGHMAEGDAGSDQRQ1IEEIEAMAAIY 
GEEWCVIDDCAKIFCIRISDDIDDPKWTLCLQVMLPNEYPGTAP 
PIYQLNAPWLKGQERADLSNSLEEIYIQNIGESILYLWVEKIRD 
VLIQKSQMTEPGPDVKKKTEEEDVECEDDLILACQPESSVKALD 
FDISETRTEVEVEELPPIDHGIPITDRRSTFQAHLAPWCPKQV 
ICMVLSKLYENKKIASATKNIYAYR1YCEDKQTFLQDCEEDGETA 
AGGRLLHLMEILnVKTTVMVWSRWYGGILLGPDRFKHIHNCARN 
ILVEKNYTNSPEESSKALGKNKKVRKDKKRNEH 


6598 


1099 


419 


PRVRWATTMAMSFEWPWQYRFPPFFTLQPNVDTRQKQLAAWCSL 
VLSFCRLHKQSSMTVMEAQESPLFNNVKLQRKLPVESIQIVLEE 
LRKKGNLEWLDKSKSSFLIMWRRPEBWGKLIYQWVSRSGQNNSV 
FTLYELTNGEDTEDEEFHGLDEATLLRALQALQQEHKAEIITVS 
DGPRRQVLLAGTCLPLLLTSHLSRAFKRRQTQCPPKTGSVTPPD 
SKGLQS 


6599 


164 


1593 


I MAi -jTTLFKYIDENQDRYIKKLAKKVAIQSVSAWPEKRGEIRR 
HMEVP.AADVKQLGGSVELVDIGKQKLPDGSEIPLEPILLGRLGS 
DPQKKTVCIYGHLDVQPAALEDGWDSEPFTLVERDGKLHGRGST 
DDK3FVAGVJINALEAYQKTGQEIPVNVRFCLEGKEESGSEGIiDE 
LIFARKDTFFKDVDYVCISDNYWLGKKKPCITYGLRGICYFFIE 
VE CSNKDLHSGVYGGSVHEAMTDL ILLMGSLVDKRG1II LI PGIN 
EAVAAVTEEEHKLYDDIDFDIEEFAKDVGAQILLHSHKKDILMH 
RWRYPSLSLHGIEGAFSGSGAKTVIPRKWGKFSIRLVPNMTPE 
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SEQ 
ID 


Predicted 

corresponding 
to first 
amino acid 
residue of 


Predicted end 

corresponding 
to first 
amino acid 


Amino acid segment containing signal peptide 
[A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
IHIistidine, I = Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutatnine , R=Arginina, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








WGEQVTSYLTKKFAELRSPNEFKVYMGHGGKPWVSDFSHPHYL 
AGRRAMKTVFGVEPDLTREGGS1PVTLTFQEATGKNVMLLPVGS 
ADDGAHSQNEKLNRYNYIEGTKMLAAYLYEVSQLKD 


S600 


2 


934 


PGRLFRVAAMESAGLEQLLRELLLPDTERIRRATEQLQ_VLRAP 
AALSALCDLLASAADPQIRQFAAVLTRRRLMTRWRRLAAEQRES 
LKSLILTALQRETEHCVSLSLAQLSATIFRKEGLEAWPQLLQLL 
QltSTIISPIISPEREMGLLLLSVWTSRPEAFQPHHRELLRLLNET 
LGEVGSPGLLFYSLRTLTTMAPYLSTEDVPLARMLVPKLIMAMQ 
TL1PIDEAKACEALEALDELLESEVPVITPYLSEVLTFCLEVAR 
NVALGNAIRIRILCCLTFLVKVKSKALLKNRLLATLAAHPFPHC 
GC 


S601 


5 2 9 


142C 


PRAAARAPPPAVLRRDRRAATAPGAGEMTLHGPLAQRYFLNHIE 
KITTWQDPRKAMNQPLNHMNLHPAVSSTPVPQRSMAVSQPNLVM 
NHQHQQQMAPSTLSQQNHPT2NPPAGLMSMPNALTTQQQQQQKL 
RLQRIQMERERIRMRQEELMRQEAALCRQLPMEAETLAPVQAAV 
NPPTMTPDMRSITNNSSDPFLNGGPYHSREQSTDSGLGLGCYSV 
PTTPEDFLSNVDEMDTGEWAGQTPMNINPQQTRFPDFLDCLPGT 
NVDLGTLESEDLIPLFNDVESALNKSEPFLTWL 


6602 


127 


617 


LLDFPALPKFVIAQSPKAGKPSTMTSMTQSLREVIKAMTKAKNF 
ERVLGKITLVSAAPGKVICEMKVEEEIITNAIGTLHGGLTATLVD 
NISTMALLCTERGAPGVSVDMNITYMSPAKLGEDIVIIAHVLKQ 
GKTLAFTSVDLTNKATGKL IAQGRHTKHLGK 


6603 




660 


EVGPSSLAARTGLGHLPFLHRLASSRGI.DMDLLQFLAFT.FVrj.T, 
SGMGATGTLRTSLDPSLEIYKKMFEVKRREC1LALKNLAQLNDI 
HQQYK1LDVMLKGLFKVLEDSRTVLTAADVLPDGPFPQDEKLKD 
AFSHWENTAFFGDVVLRFPRIVHYYFDHNSNWNLLIRWGISFC 
NQTGVFNQG PHSPILSLM 




3 


688 


TSTAQRQGGERMSFRGGGRGGFNRGGGGGGFNRGGSSNHFRGGG 
GGGGGGNFRGGGRGGFGRGGGRGGFNKGQDQGPPERWLLGEFL 
HPCEDDIVCKCTTDENKVPYFNAPVYLENKEQIGKVDEIFGQLR 
DFYFSVKLSEWMKASSFKKLQKFYIDPYKLLPLQRFLPRPPGEK 
GPPRGGGRGC-RGGGRGGGGRGGGRGGGFRGGRGGGGGGFRGGRG 
GGFRGRGH 


6605 


7 


848 


SGSRRGAMRAAGVGLVDCHCHLSAPDFDRDLDDVLEKAKKAWW 
ALVAVAEHSGEFEKIMQLSERYNGFVLPCLGVHPVQGLPPEDQR 
SVTLKDLDVALPIIENYKDRLLAIGEVGLDFSPRFAGTGEQKEE 
QRQVLIRQICLAKRLMLP\ r NVHSRSAGRPTINIiLQEQGAEKVLL 
HAFDGRPS VAMEGVRAGYFFS IPPS 1 1 RSGQQKLVKQLPLTS I C 
LETDSPALGPEKQVRNEPWNISISAEYIAQVKGISVEEVIEVTT 
QNALKLFPKLRHLLQK 


6606 


2 


1682 


FVE I P. PRAEVANLSAHSAS PI QDAVLKRLSLLED IVYRQLNGLS 
KS LGL : EG YGGRGKGGLPATLS PAEEE KAKBPHEKYGYNS YLS E 
ia SLDRS I PDYRPTKCKELKYS KDLPQ IS 1 1 FI FWEALSVI LR 
SVHSAWHTPTHLLKEIILVDDNSDEEELKVPLEEYVHKRYPGL 
VKWRNQKREGL I RAR I EGWKVATGQVTGFFDAHVE FTAGWAE P 
VLSR I QENRKRVI LPS IDNIKQDNFEVQRYENSAHGYSWELWCM 
YISPPKDWWDAGDPSLPIRTPAMIGCSFWNRKFFGEIGLLDPG 
MDVYGGENIEIGIKVWLCCGSMEVLPCSRVAHIERKKKPYNSNI 
Op-fTXRNALRVAEVWMDDYKSHVYIAWNLPLENPGIDIGDVSER 
RALRKSLKCKNFQWYLDHVYPEMRRYNNTVAYGELRIINKAKDVC 
LDQGPLENHTAILYPCHGWGPQLARYTKEGFLHLGALGTTTLLP 
DT7C'_VDIJSKSRLPQLLDCDKVKSSLYKRWNFIQNGAIMNKGTG 
RCLEVENRGLAGIDLILRSCTGQRWTIKNSIK 


6607 


137 


986 


VPACAGLKKSARSLLASPPRLLNTKLQASCRALFSPPIQSRQTT 
GISFQGRGGAGPGVPTRTQVFAAMGAVMGTFSSLQTKQRRPSKD 
KISDELEMTMVCHRPEGLEQLEAQTNFTKRELQVLYRGFKNECP 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SC-VVNEDTFKQIYAQFFPHGDASTYAHYLFNAFDTTQTGSVKFE 
DFVTALSILLRGTVHEKLRWTFNLYDINKDGYINQEEMMDIVKA 
IYDMMGKYTYPVLKEDTPRQHVDVFFQKMDKNKDGIVTLDEFLE 
SCQEDDNIMRSLQLFQNVM 


SS08 


224 




RPCFSSPTGLCPRLSYPMILLQHAVLPPPKQESPSPPMSVATRS 
TGTLQLPPQKPFGQEASLPIiAGEEELSKGGEQDCALEELCKPLY 
CKLCOTTLNSAQQAQAHYQGKNHGKKLRNYYAAMSCPPPARMSN 
WEPAATP WPVP PQMGS FKPGC-RVI LATEND YCKLCDAS FSS P 
AVAQAHYQGKNHAKRLRLAEAQSNSFSESSELGQRRARKEGNEF 
KMMPIIRRNMYTVQNNSGPYFNPRSRQRIPRDLAMCVTPSGQFYC 
SMCNVGAGEEMEFRQHLESKQHKSKVSEQRYRNEMENLGYV 


6609 






FRLRCRRFRVAGGRLAGAGLRESRVPAPEQRLSALTLLSWSAVT 
PAAEPGNFQLSPAEPRGPLA3PVRAAPRAPCPAAEMSELNTKTS 
PATNQAAGQEEKGKAGNVKKAEEEEEIDIDLTAPETEKAALAIC 
GKFRRFQKRKKDPSS 


6610 


319 


881 


GRKSLCNLHIFIRFPLTYPDMYMGMMCTAKKCG2RFQPPAIILI 
YESEIKGKIRQRIMPVRNFSKFSDCTRA^EQLKIJHPRHKSYLEQ 
VSLRQLEKLFS FLRGYLSGQS LAETMEQ I QRETTI DPEEDLNKL 
DDKEIAKRKSIMDELFEKNQXKKDDPNFVYDIEVEFPQDDQLQS 
CGWDTESADEF 


6511 


I ,» 


212 


PGCSGAGSRVWWLPALRHLAMGSTESSEGRRVSFGVnEEERVRV 
LQGVRLSENVYHRMKEPSSPPPAPTSSTFGLQDGHLRAPHKEST 
LPRSGSSGGQQPSGMKEGVKRYEQEHAAIQDKLFQVAKREREAA 
TKII5KASLPTGEGSISHEEQKSVRLARELESREAELRRRDTFYK 
EQLERIERKNAEMYKLSSEQFHEAASKMESTIKPRRVEPVCSGL 
QAQ I LHC YRDR PHE VLL CS DLVKA YQR C VSAAHKG 


6612 




992 


VSTHASALSRTQGQPDRQPRAAASGAGAGTAGGGGSGGAEGSKM 
STEAQRVDDSPSTSGGSSDGDQRESVQQEPEREQVQPKKKEGKI 
SSKTAAKLSTSAKRI3KELAEITLDPPPNCSAGPKGDNIYEWRS 
TILGPPGSVYEGGVFFLDITFSPDYPFKPPKVTFRTRIYHCNIN 
SQGVI CLDILKDNWSPALTISKVLLS I CSLLTDCNPADPLVGS I 
ATQYMTNRAEHDRMARQWTKRYAT 


C613 


130 


748 


ELELSSNMPEQSNDYRVAVFGAGGVGKSSL.VLRFVKGTFRESYI 
PTVEDTYRQVISCDKSICTLQITDTTGSHQFPAMQRLSISKGHA 
FILVYSITSRQSLEELKPIYEQICEIKGDVESIPIMLVGNKCDE 
SPSREVQSSEAEALARTWKCAFMETSAKLKTHNVKELFQELLNLE 
KRRTVS LQIDGKKS KQQKRKEKLKGKCVIM 


GS14 


3 


1191 


SSAAEAMRVLVRRCWGPPLAHGARRGRPSPQWRALARLGWEDCR 
DSRVREKPPWRVLFFGTDQFAREALRALHAARENKEEEIjIDKLE 
WTMPSPSPKGLPVKQYAVQSQLPVYEWPDVGSGEYDVGVVASF 
GRLLNEALILKFPYGHJJVHPSCLPRWRGPAPVIHTVLHGDTVT 
GVTIKQIRPKRFDVGPILKQETVPVPPKSTAKELEAVLSRLGAN 
MLISVLKNLPESLSNGRQQPMEGATYAPKISAGTSCIKWEEQTS 
EQIFRLYRAIGNIIPLQTLWMANT1KLLDLV3VNSSVIADPKLT 
GQALIPGSVIYHKQSQILLVYCKDGWIGVRSVMLKKSLTATDFY 
NGYLHPWYQKNSQAQPSQCRFQTLRLPTKKKQKKTVAMQOCIE 


6615 


832 


35 


G-wGAGASAMSELPGDVRAFLREHPSLRLQTDARKVRCILTGHE 
LPCRLPELQVYTRGKKYQRLVRASPAFDYAEFEPHIVPSTKNPH 
QLFCKLTLRHINKCPEHVLRHTQGRRYQRALCKYEECQKQGVEY 
VPACLVHRRRRREDQMDGDGPRPREAFWEPTSSDEGGAASDDSM 
TDLYPPELFTRKDLGSTEDGDGTDDFLTDKEDEKAKPPREKATD 
EGRRETTVYRGLVQKRGKKQLGSLKKKFKSHHRKPKSF33CKQS 


6616 


347 


1886 


LLPPCQGARPLSSPPHASEDNLFLFWNCILCAFPHPSPQPLQYP 
WPLLLVITQIPAPRHLRNRPFSFSRGGLDSFSGSLSTPSICRS 
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corresponding 
" > first 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine , 

P=Proline, Q=Glutamine, R=Arginine, 

S=Serine, T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

Codon, /=possible nucleotide deletion, 

\=possible nucleotide insertion) 

PAWVKMAPWPPKGLVPAVLWGLSLFLNLPGPIWLQPSPPPQSSP 

PPQPHPCHTCRGLVDSFNKGLERTIRDNFGGGNTAWEEENLSKY 

KDSETRLVEVLEGVCSKSDFECHRLLELSEELVESWWFHKQQEA 

PDLFQWLCSDSLKLCCPAGTFGPSCLPCPGGTERPCGGYGQCEG 

EGTRGGSGHCDCQAGYGGEACGQCGLGYFEAERNASHLVCSACF 

GPCARCSGPEESNCLQCKKGWALHHLKCVDIDECGTEGANCGAD 

QFCVNTEGSYECRDCAKACLGCMGAGPGRCKKCSPGYQQVGSKC 

LDVDECETEVCPGENKQCENTEGGYRCICAEGYKQMEGICVKEQ 

IPESAGFFSEMTEDELWLQQMFFGII ICALATLAAKGDLVFTA 

IFIGAVAAMTGYWLSERSDRVLEGFIKGR 

VWMAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVS 
LSYHLDTKVRCPMCWQAVDGSSSLPNVSLAWVIEALRLP3DPEP 
KVCVHHRNPLSLFCEKDQEL I CGLCGLLGSHQHHPVTP I STVCS 
RMKEELAALFSELKQEQKKVDELIAKLVKNRTRIDGSAPSLCPC 
LGPATFTFL 

DGKVARRAPNSPAFQNDIYPLVSAPRATTAESPWSKVLQNTQCR 
WPKMTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPE 
AFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEA 

WKYGT 

PASSEVLTAAVMFLLLNCIVAVSQNMGTGKNGDLPRPPLRNEFR 
YFQRMTTTSSVEGKQNLVIMGRKTWFSIPEKNRPLKDRINLVLS 
RELKKPPQGAHFLARSLDDALKLTERPELANKVDMIWIVGGSSV 
YKEAMNHLGHLKLFVTRIMQDFESDTFFSEIDLEKYKLLPEYPG 
ILSDVQEGKHIKYKFEVCEKDD 



NSRVDDFVARARMAAENEASQESALGAYSPVDYMSITSFPRLPE 
DEPAPAAPLRGRKDEDAFLGDPDTDPDSFLKSARLQRLP3SSSE 
MGSQDGSPLRETRKDPFSAAAAECSCRQDGLTVIVTACLTFATG 
VTVALVMQIYFGDPQIFQQGAWTDAARCTSLGIEVLSKQGSSV 
DAAVAAALCLGIVAPHSSGLGGGGVMLVHDIRRNESHLIDFRES 
APGALREETLQRSWETKPGLLVGVPGMVKGLHEAHQLYGRLPWS 
QVLAFAAAVAQDGFNVTHDLARALAEQLPPKMSERFRETFLPSG 
RPPLPGSLLHRPDLAEVLDVLGTSGPAAFYAGGNLTLEMVAEAQ 
IIAGGVITEEDFSNYSALVEKPVCGVYRGHLVLSPPPPHTGPALI 
SALNILEGFNLTSLVSREQALHWVAETLKIALALASRLGDPVYD 
STITESMDDMLSKVEAAYLRGHINDSQAAPAPLLPVYELDGAPT 
AAQVLIMGPDDFIVAMVSSLNQPFGSGLITPSGILLHSQMLDFS 
WPNRTANHSAPSLENSVQPGKRPLSFLLPTWRPAEGLCGTYUA 
LGANGAARGLSGLTQVRFTPWLAFFSREPSCGLDCRCLSYLWLV 
SIPHAANMG 

VQGITSYQQRLQALRKEKSRDAARSRRGF.ENF3FYELAKLLPLP 
AAITSQLDKASIIRLTISYLKMRDFANQGDPPWNIjRMEGPPPNT 
SVKVIGAQRRRSPSALAIEVFEAHLGSHILQSLDGYVFALNQEG 
KFLYISETVSIYLGLSQVELTGSSVFDYVHPGDHVEMAEQLGMK 
LPPGRGIiLSQGTAEDGASSASSSSQSETPEPWCFPPASDQFLL 
GRASGAQEETEAGGPERARAMEANMPKRKEPGRSLRIKVISMGN 
AEVGKSCIIKRYCEKRFVSKYLATIGIDYGVTKVHVRDREIKVN 
IFDMAGHPFFYEVRKPF 

KALFEKVKKFRLHVEEGDILYAMYVRQTVLKVIKFLIIIAYNSA 
LVSKVQFTVDCNVDIQDMTGYKNFSCNHTMAHLFSKLSFCYLCF 
VSIYGLTCLYTLYWLFYRSLREYSFEYVRQETGFDDIPDVKNDF 
AFMLHMIDQYDPLYSKRFAVFLSEVSENKLKQLNLNNEWTPDKL 
RQKLQTNAHNRLELPLIMLSGLPDTVFEITELQSLKLEIIKNVM 
I PATI AQLDNLQELS LHQCSVKIHSAALS FLKENLKVLS VKFDD 
MRELPPWMYGLRNLEELYLVGSLSHDISRNVTLESLRDLKSLKI 
LSIKSNVSKIPQAWDVSSHLQKMCIHNDGTKLVMLNNLKKMTN 
LTELELVHCDLERIPHAVFSLLSLQELDLKENNIjKSIEEIVSFQ 
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SEQ 


Predicted 


Predicted end 


Amino acid segment containing signal peptide 


ID 


beginning 


nucleotide 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 


NO: 


nucleotide 


location 


Glutamic Acid, F=Phenylalanine, G=Glycine, 




location 


corresponding 


H=Histidine, I=Isoleucine, K=Lysine, 




corresponding 


to first 
amino acid 


L=Leucine, M=Methionine, N-Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 






residue of 


S=Serine, T=Threonine, V=Valine, 






amino acid 


W=Tryptophan, Y=Tyrosine, X-Unknown, *=Stop 




amino acid 




Codon, /=possible nucleotide deletion, 




sequence 




\=possible nucleotide insertion) 








HLRKLTVLKLWHNSITYIPEHIKKLTSLERLSFSHNKIEVLPSH 
















DELYFCKKLKTLKIGKNSLSVLSPKIGNLLFLSYLDGKGNHFEI 








LPPELGDCRALKRAGLWEDALFETLPSDVREQKKTE 


6624 


218 


1786 


GSRRGGGSRIPAVSTHVAPGRSVLRPFASGALRLRSLVKALGGC 
RGRPSGLAHLSQETSHWRAKRSGRACLGDFPGEILRSFIMKCTA 
REWLiRVTTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEWWIA 
KQRGKRAITDNDMQSILDLHNKLRSQVYPTASNMEYMTWDVELE 
RSAESWAESCLWEHGPASLLPSIGQNLGAHWGRYRPPTFHVQSW 
YDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGC 
AINLCHNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCS 
ACPPSFGGGCRENLCYKEGSDRYYPPREEETNEIERQQSQVHDT 
HVRTRSDDSSRNEVISAQQMSQIVS CEVRLRDQCKGTTCNRYEC 
PAGCLDSKAKVIGSVHYEMQSSICRAAIHYGIIDNDGGWVDITR 
QGRKHYFIKSNRNGIQTIGKYQSANSFTVSKVTVQAVTCETTVE 
QLCPFHKPASHCPRVYCPRKLYASKSTLCSCNWNSSLF 


6625 


1124 


543 


PG?RGGGGSLLSTKALGRSRGLCriH r 3PS 3 3 3 ! "vpTALRPP 
GPLVPSTSDDNLLKNIELFDKLALRFHGRLLFLKDVLGDEICCW 

YETPRGFDPALLEATGGAAGAGGAGRGEDEENREHRVRRII-IVRR 
HITHDERPHGQQIVFKD 


6S2S 


3 


1498 


SAVEFVYTDRFHLILGISVEFLCSLRSDATMESITACLHALQAL 
LDVPWPRSKIGSDQDSGIELLNVLHRVILTRESPSIQLASLEW 
RQII CAAQEHVKEKRRSAEVDDGAAEKETLPE?GEGKDTGGLVP 
GKSLVFATLELCVCILVRQLPELNPKLTGSPGVKATKPQILLED 
GSRLVSAALVILSELPAVCSPEGSTSILPTILYLTIGVLRETAV 
KLPGGQLSSTVAASLQALKGILSSPMARAEKSRTAWTDLLRSAL 
TT I LDCWDPVDETHQELDE VSLLTAI TVF ILS TS PEVTT I P CLQ 
KRCIDKFKATLEIKDPWQIKTYQLLHSIFQYPNPAVSYPYIYS 
LASCIMEKLQEIDKRKPENTAELEIFQEC-IIWLETLVTVAEEHH 
RAQLVACLLP I L I SFLLDENSLGSATSIKRNLHDFALQNLMQIG 
PQYSSVFKSLVASSPALKARLEAAIKGNQESVKVKIPTSKYTKS 
PGKNSSIQLKTSFL 


6627 


1 


697 


GIPHLSSRDMTGTPGAVATRDGEAPERSPPCSPSYDLTGKVMLL 
GDTGVGKTCFLIQFKDGAFLSGTFIATVGIDFRNKWTVDGVRV 
KLQ I WDTAGQE RFRS VTHAY YRDAQALLLLYD I TNKS S FDN I RA 
WLTEIHEYAQRDWIMLLGNKADMSSERVIRSEDGETLAREYGV 
PFLETSAKTGMNVELAFLAIAKELKYRAGHQADEPSFQIRDYVE 
SQKKRSSCCSFM 








KEFGDSLSLEILQIIKESQQQHGLRHGDFQRYRGYCSRRQRRLR 
KTLNFKMGNRHKFTGKKVTEELLTDNRYLLLVLMDAE RAWS YAM 
QLKQEANTEPRKRFHLLSRLRKAVKHAEELERLCESNRVDAKTK 
LEAOAYTAYLSGMLRFEHQEWKAAIEAFNKCKTIYEKLASAFTE 
EQAVLYNQRVEEISPNIRYCAYNIGDQSAINELMQMRLRSGGTE 
GLLAEKLEALITQTRAKQAATMSEVEWRGRTVPVKIDKVRIFLL 
GLADNEAAIVQAESEETKERLFESMLSECRDAIQWREELKPDQ 
KQRD YI LEGEPGKVSNLQYLHS YLTYI KLSTAI KRNENMAKGLQ 
RALLQQQPEDDSKRSPRPQDLIRLYDIILQNLVELLQLPGLEED 
KAFQKEIGLKTLVFKAYRCFFIAQSYVLVKKWSEALVLYDRVLK 
YANEVNSDAGAFKNSLKDLPDVQELITQVRSEKCSLQAAAILDA 
NDAHQTETSSSQVKDNKPLVERFETFCLDPSLVTKQANLVHFPP 
GFQPIPCKPLFFDIALNHVAFPPLEDKLEQKTKSGLTGYIKGIF 
GFRS 


6629 


5653 


4549 


-1 1 L jSVGGRTGKMDAATLTYDTLRFAEFEDFPETSEPVWILG 
RKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGC 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

Sencf* 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=PhenylaIanine, G=Glycine, 
H=Histidine, I=Ist>leucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Dnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ILRi 'JJMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDR 
KDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWS 
SIAVHIAMDNTWMEEIHRLCHTSVPCAGATAFPADSDRHCNGF 
PAGAEVTNRPS PWRPLVLLI PLRLGLTDINEAYVETLKHCFMMP 
QSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFI 
PDESFHCQHPPCRMSIAELDPSIAVVRGGHLSTQAFGAECCLGM 
TRKTFGFLRFFFSMLG 


6S30 




423 


LVQCGGIRRRSAWGAMPGRHVSRVEALYKRVLQLHRVLPPDLKS 
LGDQYVKDEFRRHKTVGSDEAQRFLQEWEVYATALLQQANENRQ 
NSTGKACFGTFLPEEKLMDFRDEQ1GQLQELMQEATKPNRQFSI 
SESMKPKF 


6531 






LVQCGGIRRRSAWGAMPGRHVSRVRALYKRVLQLHRVLPPDLKS 
LGDQYVKDEFRRHKTVGSDEAQRFLQEWEVYATALLQQANENRC' 
NSTGKACFGTFLPEEKLNDFRDEQIGQLQELMQEATKPNRQFSI 
SESMKPKF 


6632 


1273 


5 88 


1 - tAi-AAAMKAWORVTRASVTVGGEQISAIGR 
GICVLLGISLEDTQKELEHMVRKILIILRVFEDESGKHWSKSVMD 
KQYEILCVSQFTLQCVLKGNXPDFHLAMPTEQAEGFYNSFLEQL 
RKTYRPELIKDGKFGAYMQVHIQNDGPVTIELESPAPGTATSDP 
KQLSKLEKQQQRKEKTRAKGPSES3KERNTPRKEDRSASSGAEG 
DVSSEREP 


6633 


1145 


617 


ATGRHEGVPTLEGIIQQLWGIITPATIPSLGPWGVLHSNPMDY 
AWGANGLDAIITQLLNQFENTGPPPADKEKIQALPTVPVTEEHV 
GSGLECPVCKDDYALGERVRQLPCNHLFHDGCIVPWLEQHDSCP 
VCRKSLTGQNTATNPPGLTGVSFSSSSSSSSSSSPSNENATSNS 


6634 


1 




CGGIPRKGSGPRRRLPMARLRDCLPRLMLTLRSLLFWSLVYCYC 
GLCASIHLLKLLWSL3KGPAQTFRRPAREHPPACLSDPSLGTHC 
YVRIKDSGLRFHYVAAGERGKPLMLLLHGFPEFWYSWRYQLREF 
KSEYRWALDLRGYGETDAPIHRQKYKLDCLITDIKDILDSLGY 
SKCVLIGHDKGGMIAWLIAICYPEMVMKLIVINFPHPWVFTEYI 
LRHPAQLLKSSYYYFFQIPWFPEFMFSINDFKVLKHLFTSHSTG 
IGRKGCQLTTEDLEAYIYVFSQPGALSGPINHYRNIFSCLPLKH 
HrWTTPTLLLWGENDAFMEVEMAEVTRFYVKNYFRLTILSEASH 
WLQQDQPDIVNKLIWTFLKEETRKKD 


6635 ■ 


1420 


470 


EMRAGQQLASMLRWTRAWRLPREGLGPHGPSFARVPVAPSSSSG 
GRGGAEPRPLPLSYRLLDGEAALPAWFLHGLFGSKTNFNSIAK 
IIAQQTGRRVLTVDARNHGDSPHSPDMSYEIMSQDLQDLLPQLG 
LVPCVWGHSMGGKTAMLLALQRPELVERLIAVDISPVESTGVS 
HFAT Y VAAMRAINI ADEL PRSRARKLADEQLS S VI QDMAVRQHL 
LTNLVEVDGRFVWRVNLDALTQHLDKILAFPQRQESYLGPTLFL 
LGGNSQFVHPSHHPEIMRLFPRAQMQTVPNAGHWIHADRPQDFI 


663S 


1514 


1801 


S FCM F S HKQDSHFQAVPVQEKKKRLRRAPWRAFAQ P QRLKHPAE 
QPIVRQCLQRPPLCGVLGPVQQQLPPSLGPVLSPHSDPGWCRVD 
DGGDGVF 


6637 


2 


1501 


CSSSPCFHDGTCVLDKAGSYKCACLAGYTGQRCENLLEAGKSKI 
KASEDSLSVLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGT 
WSFFCNNSYVLSGN3KRTCQQNGEWSGKQPICIKACREPKISD 
LVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTKKPALPFG 
DLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPS 
CIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAW 
FLVCSGALVNERTW/AAHCVTDLGKVTMIKTADLKWLGKFYR 
DDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARIS 
TRVQPICLAASRDLSTSFQESHITVAGWNVIADVRSPGFKNDTL 
RSGWSWDSLLCEEQHEDHGIPVSVTD1IMFCASWEPTAPSDIC 
TAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAF 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 

amino acid 
residue of 


Predd cted end 
nucleotide 

corresponding 

residue of 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 

S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TKVIjPFKDWIERNMK 








PNSDIDLSNLERLEKYRSFDRYRRRAEQEAQAPHWWRTYREYFG 
EKTDPKEKIDIGLPPPKVSRTQQLLERKQAIQELRANVEEEPAA 
RLRTASVPLDAVRAEWERTCGPYHKQRLAEYYGLYRDLFHGATF 
VPRVPLHVAYAVGEDDLMPVYCGNEVTPTEAAQAPEVTYEAEEG 
SLWTLLLTSLDGHLLEPDAEYLHWLLTNIPGNRVAEGQVTCPYL 
PPFPARGSGIHRLAFLLFKQDQPIDFSEDARPSPCYQLAQRTFR 
TFDFYKKHQETMTPAGLSFFQCRWDDSVTYIFHQLLDMREPVFE 
FVRPPPYHPKQKRFPHRQPLRYLDRYRDSHEPTYGIY 


GG39 


2046 


12S8 


IGCFIMDGGDDGNLIIKKRFVSEAELDBRRKRRQEEWEKVRKPE 
DPEECPEEVYDPRSLYERLQEQKDRKQQEYEEQFKFKNMVRGLD 
EDETNFLDEVSRQQELIEKQRREEELKELKEYRNNLKKVGISQE 
NKKE VEKKLTVKP I ETKNKFSQAKLIAGAVKHKS 3 ESGNSVKRL 
KPDPEPDDKNQEPSSCKSLGNTSLSGPSIHCPSAAVCIGILPGL 
GAYoGSSDSES So JSEGT JNATGKIVSGIl'RTNT L''1jEA? 


6640 


117 


1043 


VLEPPDVSMAESEDRSLRIVLVGIi T 5 IKS TAMTTLGEEIFDS 
RIAAQAVTKNCQKASREWQGRDLLVVDTPGLFDTKESLDTTCKE 
ISRCIISSCPGPHAIVLVLLLGRYTEEEQKTVALIKAVFGKSAM 
KHMVILFTRKEELEGQSFHDFIADADVGLKSIVKECGNRCCAFS 
NSKKTSKAEKESQVQELVELIEKMVQCNEGAYFSDDIYKDTEER 
LKQREEVLRKIYTDQLNEEIKLVEEDKHKSEEKKEKEIKLLKLK 
i 'Li T IRE ^-NIFKDVFNRIWKMLSEIWHRFLSKCKFYSS 


6641 




894 


SAAVGRRSEVRGCAPRPRLRRSARRMDPVPGTDSAPLAGLAWSS 
ASAPPPRGFSAISCTVEGAPASFGKSFAQKSGYFLCLSSLGSLE 
NPQENWAD I Q I WDKS PLPLGFS PVCDPMDSKAS VSKKKRMCV 
KLLPLGATDTAVFDVELSGKTKTVPGYLRIGDMGGFAIWCKKAK 
APRPVPKPRGLSRDMQGLSLDAASQPSKGGLLERTASRLGSRAS 
TLRRNDSIYEASSLYGISAMDGVPFTLHPRFEGKSCSPLAFSAF 
GDLTIKSIADIEEEYNYGFWEKTAAARLPPSVS 


SS42 


22 


129S 


PLEERMMTKMDPNDQAQRDIIFELRRIAFDAESDPSNAPGSGTE 

HQDTYIRIVLENESREDKHECPFGRSAIELTKMLCEILQVGELP 
NEGRNDYIIPMFFTHDRAFEELFGICIQLLNKTWKEMRATAEDFN 
KVMQWREQITRALPSKPNSLDQFKSKLRSLSYSEILRLRQSER 
MSQDDFQSPPIVELREKIQPEILELIKQQRLNRLCEGSSFRKIG 
NRRRQERFWYCRLALWHKVLHYGDLDDWPQGEVTFESLQEKIPV 
ADIKAIVTGKDCPHMKEKSALKQNKEVLELAFSILYDPDETLWF 
IAPNKYEYCIWIDGLSALLGKDMSSELTKBDLDTLLSMEMKLRL 
LDLENIQIPEAPPPIPKEPSSYDFVYHYG 


6643 


3049 


2265 


SLHAPA2GRTRGRLAEKPKMLTRKIKLWDINAHITCRLCSGYLI 
DATTVTECLHTFCRSCLVKYLEENNTCPTCRIVIHQSHPLQYIG 
HDRTMQDIVYKLVPGLQEAEMRKQREFYHKLGMEVPGDIKGETC 
SAKQHLDSHRNGETKADDSSNKEAAEEKPEEDNDYHRSDEQVSI 
CLECNSSKLRGLKRKWIRCSAQATVLHjKKFIAKKLKTLSSFNEL 
DILCNEEILGKDHTLKFWVTRWRFKKAPLLLHYRPKMBLL 


6644 


1489 


290 


FRPLATEPRGSSPVQLVSSTMSVRTLPLLFLNLGGEMLYILDQR 
LRAQNI PGDKARKVLNDI I STMFNRKFMEELFKPQELYSKKALR 
TVYERLAHASIMKLNQASMDKLYDLMTMAFKYQVLLCPRPKDVL 
LVTFNHLDTI KGF IRDS PTI LQQVDET LRQLTE I YGGLSAGEFQ 
LIRQTLLIFFQDLHIRVSMFLKDKVQNNN3RFVLPVSGPVPWGT 
EVPGLIRMFNNKGEEVKRIEFKHGGNYVPAPKEGSFEFYGDRVL 
KLGTNMYSVNQPVETHVSGSSKNLASWTQESIAPNPLAKEELNF 
LARLMGGMEIKKPSGPEPGFRLNLFTTDEEEEQAALTRPEELSY 
EVINIQATQDQQRSEELARIMGEFEITEQPRLSTSKGDDLIAMM 
DEL 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

location 

corresponding 

residue of 
sequence 


Predicted end 

nucleotide 

location 

to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=poseible nucleotide deletion, 
\=possible nucleotide insertion) 








GQRSTPLI1AARNGHAKWRLLLEHYRVQTQQTGTVRFDGYVID 
GATALWCAAGAGHFEWKLLVSHGANVNHTTVTNSTPLRAACFD 
GRLDIVKYLVENNANI S IANKYDNTCLMIAAYKGHTDWRYLLE 
QRADPNAKAHCGATALHFAAEAGHIDIVKEL1KWRAA1WNGHG 
MTPLKUAAES CKADWELLLSHADCDRRS R IEALELLGA3 FAND 
RENYDIIKTYHYLYLAMLERFQDGDNILEKEVLPP1HAYGNRTE 
CRNPQELES IRQDRDALHMEGLI VRERILGADNIDVSHP I IYRG 
AVYADNMEFEQCIKLWLHALHLRQKGNRNTHKDLLRFAQVFSQM 
IHLNETVKAPDIECVLRCSVLEIEQSMNRVKNISDADVHNAMDN 
YECNLYTFLYLVCISTKTQCSEEDQCKINKQIYNLIHLDPRTRE 
GFTLLH1AVHSNTPVDDFHTNDVCSFPNALVTKLLLDCGAEWA 
VDNEGNSALHI I VQYNRP I SDFLTLHS I I ISLVEAGAHTDMTNK 
QNKTPLDKSTTGVSEILLKTQMKMSLKCLAARAVRANDINYQDQ 
I PRTLEEFVGFH 


6646 


176 


890 


PSSRMNHLPEDMENALTGSQSSHASLRNIHSiWPTQLMARIESY 
EGREKKGISDVRRTFCLFVTFDLLFVTLLWI IZLNVNGG 1 bJNTIj 
EKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIAL 
TTAVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLD 
FKVLPQEAEEENRLLIVQDASERAALIPGGLSDGQFYSPPESEA 
GSEEAEEKQDSEKPLLEL 


6647 






E 1 Rl iIHLPEDMENALTGSQSSHAS^RNIHSINPTQLMARIESY 
EGREKKGISDVRRTFCLFVTFDLLFVTLLWIIELNVNGGIENTL 
EKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIAL 
TTAVTSAFLLAKVILSKLFSQGAFGYVLFIISFILAWIETWFLD 
FKVLPQEAEEENRLLIVQDASERAALIPGGLSDGQFYSPPESEA 
GSEEAEEKQDSEKPLLEL 


6648 


413 


837 


RNCWNCFTKYFNSPPEDIDHKDSYLITRSIMAEPDYIEDDNPEL 
IRPQKLINPVKTSRNHQDLHRELLMWQKRGLAPQNKPELQKVME 
KRKRDQVIKQKEEEAQKKKSDLEIELLKRQQKLEQLELEKQKLQ 
EEQENAPEFVKVKGNLRRTGQEVAQAQES 


6649 


1357 


832 


WIPRAAGIRHEVKWDVKEIMSQHNIYVDALLKEFEQFNRRLNEV 
SKRVRIPLPVSNILWEHCIRLANRTIVEGYANVKKCSNEGRALM 
QLDFQQFLMKLEKLTDIRPIPDKEFVETYIKAYYLTENDMERWI 
KEHREYSTKQLTNLVWVCLGSHINKKARQKLLAAIDDID2PKR 


6S50 


32 


765 


LVPLVFSLLVQSCKQVYRSIAMKFVPCLLLVTLSCLGTLGQAPR 
QKQGSTGEEFHFQTGGRDSCTMRPSSLGQGAGEVWLRVDCRNTD 
QTYWCEYRGQPSMCQAFAADPKSYWNQALQELRRLHHACQGAPV 
LRPSVCREAGPQAHMQQVTSSLKGSPEPNQQPEAGTPSLRPKAT 
VKLTEATQLGKDSMEELGKAKPTTRPTAKPTQPGPRPC-GNEEAK 
KKAWEHCWKP FQALCAFL ISFFRG 


6651 


3425 


1353 


AKELLKVGDFSLC^GPYQNTADTMEIILSKEPIjASFVSESFDISA 
CGIATEHVKIDNSGEGLTAEAGSETLSRDGEVGVNSDMHYELSG 
DSDLDLLGDCRNPRLDLEDSYTLRGSYTRKKDVPTDGYESSLNF 
HtCvNQEDWGCSSWVPGMETSLPPGHVITAAVKKEEKCVPPYVQIR 
DLHGILRTYANFSITKELKDTMRTSHGLRRHPSFSANCGLPSSW 
TSTWQVADDLTQNTLDLEYLRFAHKLKQTIKNGDSQHSASSANV 
FPKESPTQISIGAFPSTKISEAPFLHPAPRSRSPLLVTWESDP 
RPQGQPRRGYTASSLDSSSSWRERCSHNRDLRNSQRNHTVSFHL 
NKLKYNSTVKESRNDISLILNEYAEFNKVMKNSNQFIFQDKELN 
DVSGE ATAQE MYLPFPGRSASYEDIII DVCTNLHVKLRS WKEA 
CKSTFLF YLVETEDKSFFVRTKNLLRKGGHTE I E PQHFCQAFHR 
ENDTLIIIIRKEDISSHLHQIPSLLKLKHFPSVIFAGVDSPGDV 
LDHTYQELFRAGGFVISDDKILEAVTLVQLKEIIKILEKLNGNG 
RWKWLLHYRENKKLKEDERVDSTAHKKNIMLKSFQSANIIELLH 
YHQCDSRSSTKAEILKCLLNLQIQHIDARFAVLLTDXPTIPREV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


p-redi.cted end 
nucleotide 
location 
corresponding 

sequence 


Amxno acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 

Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








FENNGILVTDVNNFIENIEKIAAPFRSSYW 


6652 


2 


1343 


IPGSTISCSCHSRRLRGGSPAPRLSLGAASPRPRPPSLPLPLPL 
PFPLFLPTRPAERAWIRSRRASEWVGKMEVPRLDHALNSPTSPC 
EEVIKNLSLEAIQLCDRDGNKSQDSGIAEMEELPVPHNIKISNI 
TCDSFKISWEMDS KSKDRI THYF IDLNKKENKNSNKFKHKDVPT 
KLVAKAVPLPMTVRGHWFLS PRTEYTVAVQTASKQVDGDYWSE 
WSEIIEFCTADYSKVHLTQLLEKAEVIAGRMLKFSVFYRNQHKE 
YFDYWEHHGNAMQPSVKDNSGSHGSPISGKLEGIFFSCSTEFN 
TGKPPQDS P YGRYRFE IAAEKLFNPNTNLYFGDFYCMYTAYHYV 
ILVIAPVGSPGDEFCKQRLPQLNSKDNKPLTCTEEDGVLVYHHA 
QDVILEVIYTDPVDLSLGTVAEITGHOLMSLSTANAKKDPSCKT 


S653 


170 


1910 


FFLEPRLRPFPASRARFVPARTRPSPLHPCCFCFEGGGSMLSPQ 
RVAAAASRGADDAMESSKPGPVQWLV0KDQH3FELDEKAIASI 
LLQDHIRDLDWWSVAGAFRKGKSFILDFMLRYLYSQKESGHS 
NW1.GDPEEPLTGFSWRGGSDPETTGIQIWSEVFTVEKPGGKKVA 
WLMDTQGAFDSQSTVKDCATIFALSTMTSSVQIYNLSQNIQED 
DLQQLQLFTEYGRLAMDEIFQKPFQTLMFLVRDWSFPYEYSYGL 
QGGMAFLUKRLQVKEHQHEEIQNVRNHIIISCFSDVTCFLLPHPG 
LQVATSPDFDGKLKD IAGEFKEQLQALIPYVLNPSKLMEKEING 
SKVTCRGLLEYFKAYIKI YQGEDLPHPKSMLQATAEAYNLAAAA 
SAKDIYYNNMEEVCGGEKPYLSPDILEEKHCEFKQLALDHFKKT 
KXMGGKDFSFRYQQKLEE3IKELYENFCKHNGSKNVFSTFRTPA 
VLFTGIVALYIASGLTGFIGLEWAQLFNCMVGLLLIALLTWGY 
IRYSGQYRELGGAIDFGAAYVLEQASSHIGNSTQATVRDAWGR 
PSMDKKAQ 


6554 


1 


705 


RTSLSPSQCSSFNLAMASAGMQILGVVLTLLGWVNGLVSCALPM 
WKVTAFIGNSIWAQWWEGLWMSCWQSTGQMQCKVYDSLLAL 
PQDLQAARALCVIALLVALFGLLVYLASAKCTTCVEEKDSKARL 
VLTSGI VFVI SGVLTLI PVCWTAHAVIRDFYNPLVAEAQKR3LG 
ASLYLGWAASGLLIjIjGGGLLCCTCPSGGSQGPSHYMARYSTSAP 
AISRGPSEYPTKNYV 


fifi55 


341 


IS 


KDAYMFKKGLLAIALVFSLPVFAAEIIWIDVRVPEQYQQEHVQGA 
INI PLKEVKERIATAVPDKNDTVKVYCNAGRQSGQAKE ILSEMG 
YTEVENAGGLKDIAMPKVKG 


GG5G 




1212 


TELPPRPANLAIQPPLSPLRALAPLPEKPGAVP.PPQICRMAKVAK 
DLNPGVKKMSLGQLQSARGVACLGCKGTCSGFEPHSWRKICKSC 
KCSQEDHCLTSDLEDDRKiGRLLMDSKYSTLTARVKGGDGIRIY 
KRNRMIMTNPIATGKDPTFDTITYEWAPPGVTQKLGLQYMELIP 
KEKQPVTGTEGAFYRRRQLMHQLPIYDQDPSRCRGLLENELKLM 
EEFVKQYKSEALGVGEVALPGQGGLPKEE3KQQEKPEGAETTAA 
TTNGSLSDPSKEVEYVCELCKGAAPPDSPWYSDRAGYNKQWHP 
TCFVCAKCSE PLVDLI YFWKDGAPWCGRHYCES LRPRCSGCDE I 
I FAEDYQRVEDLAWHRKHFVCEGCEQLLSGRAY I VTKGQLLCPT 


6657 




2120 


LLTCQERAGDCLLSASTMKEWYWSPKKV.ADWLLENAMPEYCE? 
LEHFTGQDLINLTQEDFKKPPLCRVSSDNGQRLLDMIETLKMEH 
HLEAHKNGHANGHLNIGVD I PTPDGS FS I KIKPNGMPNGYRKEM 
I KI PMPELERSQYPMEWGKTFLAFLYALS CFVLTTVMIS WHER 
VPPKEVQPPLPDTFFDHFNHVQWAFS I CEINGMILVGLWLIQWL 
LLKYKSIISRRFFCIVGTLYLYRCITMYVTTLPVPGMHFNCSPK 
LjGDKSAQLRRIMKLIAGGGLSITGSHNMCGDYLYSGHTVMLTL 
TYLFIKEYS PRRLWWYHWICWLLSWGIFCILLAHDHYTVDWV 
AYYITTRLFWWYHTMANQQVLKEASQMNLIARVWWYRPFQYFEK 
NVQG IVPRS YHW P FP W PWHLS RQVKYS RLVNDT 


665B 


35 




HCCALGAPGSPYRGLYFSSAAPCTAPRKAKHQSTLEGLTKRMLM 



536 



WO 01/53312 



PCT7US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 

corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P-Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








FDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPEKFF 
QTPEGLSHGIQMEPVDLTVNKRSSPPSAGNS PSSLKFPSSHRRA 
SPGLSMPSSSPPIKKYSPPSPGVQPFGVPLSMPPVMAAALSRHG 
IRSPGILPVIQPVWQPVPFMYTSHLQQPLMVSLSEEMENSSSS 
MQVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSV 
SPPQALLQE 


6659 


18 


523 


EPQRGDCETWFQNCSLPKFVCFFCWGFWLWRAHSMSNLHSLPGL 
CGNPTI TCPHNRTLNNCHHSGVQVPLMYCKLTTPSPQWI SNCRY 


6660 


514 


1707 


CAASLDCRHHLCEPDMKLVWPSAKLLQAAAGASARACDSVTSNV 
LPLLLEQFHKHSQSSQRRTILEMLLGFLKLQQKKSYEDKDQRPL 
NGFKDQLCSLVFMALTDPSTQLQLVGIRTLTVLGAQPDLLSYED 
LELAVGHLYRLSFLKEDSQSCRVAALEASGTLAALYPVAFSSHL 
VPKLAEELRVGESNLTNGDEPTQCSRKLCCLQALSAVSTHPSIV 
KETLPLLLQHLWQVNRGNMVAQS 3D 1' i 1 " , L r 
ESCWYFHQTAIPCLLALAVQASMPEKEPSVLRKVLLEDEVLAAM 
VSVIGTATTHLSPELAAQSVTHIVPLFLDCiNVSFLPENSFPSPF 
QPFQDGSSGQRRLIALLMAFVCSLPRNVSEHIWEVLLFNLDKVT 
PG 


S661 


179 




GVHAASGTLSATWLAEAKMFDSLAICAGKYLGQAAKLMIGMPDYD 
NYVEHKRWHPDQTPMTYEEFFRERQDARYGGKGGARCC 


5662 


185 


423 


R S L P KPAPAQ PAS 1 HCARFSG VT P PTAKTAMS DGNTAFNALM Y C 
GPKADDGNIFSACAPASSAVKASVSVAQPGQAVIP 








PKENLFSFQTASTTMQAISNFRKHLRMVGSRRVKAQTFAERRER 
SFSRSWSDPTPMKADTSHDSRDSSDLQSSHCTLDEAFEDLDWDT 
EKGLEAVACDTEGFVPPKVML1SSKVPKAEYIPTIIRRDDPSII 
PILYDHEHATFEDILEEIERKLNVYHKGAKIWXMLIFCQGGPGH 
LYLLKNKVATFAKVEKEE3MIHFWKRLSRLMSKVNPEPOTIHIM 
GCYILGNPNGEKLFQNLRTLMTPYRVTFESPLELSAQGKQMIET 
YFDFRLYRLWKSRQHSKLLDFDDVL 




58 


968 


PRLLRLPRSWVMDSPWDELALAFSRTSMFPFFDIAIIYLVSVMA 

LANHTNILLASSIWYITFFCPHDLVSQGYSYLPVQLLASGMKEV 
TRTWKIVGGVTHANSYYKNGWIVMIAIGWARGAGGTIITNFERL 
VKGDWKPEGDEWLKMSYPAKVTLLGSVIFTFQHTQHLAISKHNL 
1 1 i 1 J KMTTQTSTMTFAPFEDTLSWMLFGWQQPFS 
5CEKKSEAKSPSNGVGSLASKPVDVASDNVKKKHTKKNE 


6SS5 


171 


12 78 


DERRLACRQWTQQRSELYPGFQKRQRFLPKAGEEAAAQGGRHL 
PGRWLGPGCTQNPCSVHTATGPEPRKLPLLPPDSPNSGYPKEPA 
ALCPGIPSPCRMTHQDLSITAKLINGGVAGLVGVTCVFPIDLAK 
TRLQNQHGKAMYKGMIDCLMKTARAEGFFGMYRGAATOLTLVTP 
EKAIKLAANDFFRRLLMEDGMQRNLKMEMIAGCGAGMCQVVVTC 
PMEMLKIQLQDAGRLAVHHQGSASAPSTSRSYTTGSASTHRRPS 
ATLIAWELLRTQGLAGLYRGLGATLLRDIPFS 1 1 YFPLFANLNN 
LGFNELAGKASFAHSFVSGCVAGSIAAVAVTPLDVLKTRIQTLK 
KGLGEDMYSGITDCAR 


6SS6 


498 


2868 


KTTFLPVPQMMAGFSFGTFGNPPMESPSAWQTIHQPFIVSCLTL 
WSPGCWPQPIQKEGVGLWDIRKPQSSLLRYGGNLSLQSAMSVRF 
NSNGTQLLALRRRLPPVLYDIHSRLPVFQFDNQVYFMSCTMKSC 
CFAGDRDQYILSGSDDFNLYMWRIPADPEAGGIGRWNGAFMVL 
KGHRSIVNQVRFNPHTYMICSSGVEKIIKIWSPYKQPGCTGDLD 
GRIEDDSRCLYTHEEYISLVLNSGSGLSHDYANQSVQEDPRMMA 
FFDSLVRREIEGWSSDSDSDLSESTILQLHAGVSERSGYTDSES 
SASLPRSPPPTVDESADNAFHLGPLRVTTTNTVASTPPTPTCED 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AASRQQRLSALRRYQDKRLLALSNESDSEENVCEVELDTDLFPR 
PRSPSPEDESSSSSSSSSSEDEEELNERFASTWQRNAMRRRQKT 
TREDXPSAPIKPTNTYIGEDNYDYPQIKVDDLSSSPTSSPERST 
STLEIQPSRASPTSDIESVKRKTYKAYKWLRYSYISYSNNKDGE 
TSLVTGEADEGRAGTSHKDNPAPSS3KEACLNIAMAQRNQDLPP 
E3CSXDTFKEETPRTPSNGPGHEHSSHAWAEVPEGTSQDTGNSG 
SVEHPFETKKLNGKALSSRAEEPPSPPVPKASGSTLNSGSGNCP 
RTQSDDSEERSLETICANHNNGRLHPRPPHPHNNGQNLGELEW 
AYSSPGHSDTDRDNSSL7GTLLHKDCCGSEMAC3TPNAGTREDP 
TDTPATDSSRAVHGHSGLKRQRIELEDTDSENSSSEKKLKT 


6667 


171 


1310 


AEEVERIAAMRSDSLVPGTHTPPIRRRSKFANLGRIFKPWKWRK 
KKSEKFKHTSAALERKISMRQSREELIKRGVLKEIYDKDGELSI 
SNEEDSLENGQSLSSSQLSLPALSEMEPVPMPRDPCSYEVLQPS 
DIMDGPDPGAPVKLPCLPVKLSPPLPPKKVMICMPVGGPDLSLV 
SYTAQKSGQQGVAQHHHTVLPSQIQHQliQYGSHGQHLPSTTHSL 
PKIIPSGCRMIDET.NKTLAMTMQRLESSEQRVPCSTSYHSSGLHfc 
GDGVTKAGPMGLPEIRQVPTWIECDDNKENVPHESDYEDSSCL 
YTREEEEEEEDEDDDSSLYTSSLAMKVCRKDSLAIKPSNRP3KR 
ELEEKNILPRQTDEERLELRQQIGTKL 


6668 


714 




L fi ATGPALTLRCHVCTSSSNCKHSVVCPASSRFCKTTNTVEP 
LRGNLVKKDCAESCTPSYTLQGQVSSGTSSTQCCQEDLCNEKLH 
NAAPTRT LAI LfLGLALSLLAVILAPSL 


S6S9 


459 


1207 


KDEETRKDYDYMLDHPEEYYSHYYHYYSRRLAPKVDVRWILVS 
VCAISVFQFFSWWNSYNKAISYLATVPKYRIQATEIAKQQG1.LK 
KAKEKGKNKKSKEEIRDEEENIIKNI1KSKIDIKGGYQKPQICD 
LLLFQIILAPFHLCRYTWYCRWIYNFNIKGKEYGEEERLYIIR 
KSMKMSKSQFDSLEDHQKETFLKRELWIKENYEVYKQEQEEELK 
KKLANDPRWKRYRRWMKNEGPGRLTFVDD 


6670 


184 


594 


VARI*cEAAKMSSEPPPPYPGGPTAPLIjEEKSGAPPTPGRSSPA 
VMQPPPGMPLPPADIGPPPYEPP3HPMPQPGFIPPHMSADGTYM 
PPGFYPPPGPHPPMGYYPPGPYTPGPYPGPGGHTATVLVPSGAA 
TTVTV 


6671 


1 


763 


LPAEKPRSAPNMAGGRCGPQLTALLAAWIAAVAATAGPEEAALP 
PEQSRVQPMTASNWTLVMEGEWMLKFYAPWCPSCQQTDSEWEAF 
AKN-GEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRR 
ITEDLaNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFS 
ISGKIWHLHNYFTVTLGIPAWCSYVFFVIATLVFGLSMDLVL*V 
I SQCNWDPP YRHVS * /RPSTNLGVHTAHTSEHLRL 


S672 


3 04 


1089 


APGSKPVQFMDFEGKTSF3MSVFNLSNA_MG3GILGLAYAMAHT 
GVIFFLALLLCIALLSSYSIHLLLTCAGIAGIRAYEQLGQRAFG 
PAGKWVATVI CLHNVGAMS S YLF I IKSELPLVIGTFLYMDPEG 
DWFLKGNLLIIIVSVLIILPLALMKHLGYLGYTSGLSLTCMLFF 
LVSVIYKKFQLGLCYRATMKQQWESEALVGTPQPRDSTAAVKAQ 
MFHS * L TG VLTQWP I MAFAF VCHPGGAGP S I TELCRAFQAQD 


G673 


1116 


1963 


rHLGSHQLLANAGTMLCRQQSSSMAPAFSQ 
SVTCGPSPCVRKQESATKCLHIGACGSDLWARGWEQG*G*GLNV 
WLCPCVAFHRGARPQAEEGGARWNSLVSSPWIPPNP+HSSIGAE 
NAVPRP*QG*KVNPSGQERQS\WVLPLPVPGEPi,KLPGLPG*NK 
SFSRV/SGSKGKWILPRQLM*AS*R\TPRFVPGTQWVPITW/PL 
ITWH*SAPTPPLKACPAPRESDPCSSCLSCPCVTQHPRFSDTGW 
FGAGHCHSSCDFTRKGAAGGPG 


6674 


1 


440 


LE FD YMCQYDYVE VRDGDNRDGQI I KRVCGNERPAP I QS IGSSL 
HVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSY 
KCACLAGYTGQRCENLLEERNCSDPG/WPSQWVPENNRGPWAYQ 
PTPC* IGTRVAFFLT 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

sequence 


Predicted end 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Dnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6675 


277 


1678 


GKVIPTERMAFLDNPTIILAHIRQSHVTSDDTGMCEMVLIDHDVD 
LEKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 
HRSNTAQRLERLRKERQNQIKCKNIQWKERNSKQSAQELKSLPE 
KKSLKEKPPISGKQSILSVRLEQCPLQLNNPFNEYSKFDGKGHV 
GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDLIGLICWQ 
YTSEGREPKLNDNVSAYCLHIAEDDGEVDTDFPPLDSNEPIHKF 
GFSTLALVEKYSSPGLTSKESLFVRINAAHGFSLIQVDNTKVTM 
KE ILLKAVKRRKGSCKVSGSRADGVFEEDSQ I D IATVQDMLS SH 
HYKSFKVSMIHRLRFTTDVQL/GCALFPGVLRKRAAPVDCLRPS 
ADTWRQEQIGCCGAACAALRS*DSHKC*EGISGDKVEIDPVTNC 
KASTKFWIKQKPISIDSDLLCAC\DLAEE 


6676 


277 


«,. 


GNWPTERMAFLDNPTIILAHIRQSHVTSDDTGMCEMVLIDHDVB 
LEKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 
RRSNTAQRLERLRKERQIJQIKCKNIQWKERNSKQSAQELKSLPE 
KKSLKEKPPISGKQSILSVRLEQCPLQLNNPFNEYSKFDGKGHV 
GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDLIGLICWQ 
YTSEGREPKLNDNVSAYCLHIAEDDGEVDTDFPPLDSNEPIHKF 
GFSTLALVEKYSSPGLTSKESLFVRINAAHGFSLIQVDNTKVTM 
KEILLKAVKRRKGSQKVSGSRADGVFEEDSQIDIATVQDMLSSH 
HYKSFKVSMIHRLRFTTDVQL/GCALFPGVLRKRAAPVDCLRPS 
ADTWRQEQIGCCGAACAALRS*DSHKC*EGISGDKVEIDPVTNQ 
1 1l t I 1 E 


6677 






GKWPTSRMAFLDNPTIILAHIRQSHVTSDDTGMCEMVLlDn^VD 
LEKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 
RRSNTAQKLL. L J .. ERQ 1Q 1 1 < KN XQWKEHNSKQSAUELKSLFE 
KKSLKEKPPISGKQSILSVRLEQCPLQLNNPFNEYSKFDGKGHV 
GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDLIGLICWQ 
YTSEGREPKLNDNVSAYCLHIAEDDGEVDTDFPPLDSNEPIHKF 
GFSTLALVEKYSSPGLTSKESLFVRINAAIIGFSLIQVDNTKVTM 
KEILLKAVKRRKGSQKVSGSRADGVFEEDSQIDIATVQDMLSSH 
HYKSFKVSMIHRLRFTTDVQL/GCALFPGVLRKRAAPVDCLRPS 
ADTWRQEQIGCCGAACAALRS*ESHKC*EGISGDKVEIDPVTNQ 
KASTKFWIKQKPISIDSDLLCAC\DLAEE 


6678 


221 




GPSKQSSGSLSLIVTGCSSYWS*INDTCTILRVLSSNFGRQ*LR 
PFPCSQLPMSQGCLWHLDCCCPWVPYIPGQQWRKGRQRMRN*QS 
LLGSDQESVGLEDLCVFWFLLHVLLGLFP*PHELFLLPWDLG 
FLFPLLLQGGCHCLVLPANLVSCAPQIGKLSCRLQTHDLEGSRN 
HHPLFLWGRWDAVKHLETVQSGLASLGFVGQHTSHGPP 


6679 


2 


786 


LEFARGAMPFLGQDWRSPGQNWVKTVDGWKRFLDEKSGSFVSDL 
SSYCWKEVYNKENLFNSLNYD/SCSQEEKEGHAE*QNQNS\DFH 
QEKWIYVHKGSTKERHGYCTLGEAFKRLDFSTA-LDSRRFKYW 
RLLELIAKSQLTSLSGIAQKNFMNILEKW1KVLEDQQNITLIR 
ELLQTLYTSLCTLVKRVGKSVLVGNINMMVYRMETILHWQQQLW 
NIQITRVSGQAQPPPGSGSLHRDTGQTRQDFEFTPVTEESGLF 


6680 


1498 


2951 


PLCTLPLMPSALPGWAGERWEKQWPLA/PGPGTWQTPVGSISEE 
P\RKNEPDTHCPRGEARPEV*HLPKPHSPGSEGAEIQTSA*ALP 
/NQVSPPQPK*GAEENGDQRGGKEEAGEELHRSSSGLTAAPGFP 
EVHRNLQTFPGLPSRGGGP / GGAGTQGSWAPGEQPP/SPLLPAS 
MQRSQAGLPGWEAGLVESPTHHIPALRPSGTNATGEAFPSTTCS 
SGP\PAPPGPTGLRP3GGSSSGGHG**PGLPVGKV\GALGAAQD 
PQSQGRGPTQGTVGTEMLLSGLGSAKACPAARPAVP*LPSDPAS 
TIPKKGTRGFGEGPGVLQERNRWWGRAQGFTSADAAGTAPPGV 
*LPAPLSQPPGATEPQVRACGMAPPSPGTSGRLVAWGRHPGPQV 
AQGCPPGAGCWGSQPRGSQRCPRTYTHSPLGHGRAPCPRRCWH* 
WQDPPSSPRTGCLPGIPARQAYSAPRTRSRPGIRTGRAAYGFIR 
FQGGGGG 
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SEQ 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Ijeucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








KMTVGVLTQTVGPWSRPGAYLSKQLDGVSKGWPPCPRALAATAL 
LAQEADELTLRQNLNRKSPHA\WTLINTKGHH*LINARLTRYQ 
TLLCENPHKTIEVSNT/LNPATLLLVTESPVKHNCLEVLDSVYS 
SRPNLRDHP*TSVDWELYVDGSGFANPCKVTLKKETSPA?VTPR 
S 


S682 


109 


1238 


TVLCGAMQVSSLNEVKIYSLSCGKSLPEWLSDRKKRALQKKDVD 
VRRRIELIQDFEMPTVCTTIKVSKDGQYILATGTYKPRVRCYDT 
YQLSLKFERCLDSEWTFEILSDDYSKIVFLHNDRYIEFHSQSG 

PLQTDAAENNVCDINSVHGLFATGTIEGRVECWDPRTRNRVGLL 
D\AP*TVSQQIQR*TSLPTISALKFN\GALTMAVGTTTGQVLLY 
DLRSDKPLLVKDHQYGLPIKSVHFQDSLDLILSADSRIVKMWNK 

GPAPRWCSFLDNLTEELEENPESNE 




109 


1238 


TVLCGAMQVSSLNEVKIYSLSCGKSLPEKLSDRKKRALQKKDVD 
VRRRIELIQDFEMPTVCTTIKVSKDGQYILATGTYKPRVRCYDT 
YQLSLKFERCLDSEWTFEILSDDYSKIVFLHNDRYIEFHSQSG 
FYYKTRIPKFGRDFSYHYPSCDLYFVGASSEVYRLNLEQGRYLN 
PLQTDAAENNVCDINSVHGLFATGTIEGRVECWDPRTRNRVGLL 
D\AP*TVSQQIQR*TSLPTISALKFN\GALTMAVGTTTGQVLLY 
DLRSDKPLLVKDHQYGLPIKSVHFQDSLDLILSADSRIVKMWNK 
NSGKIFTSLEPEHDLNDVCLYPNSGMLLTANETPKMGIYYIPVL 


6684 


111 


527 


GLRGGTSRGRAGREPEFAAGVLCWAGFCQSPCPPGSRGREAPA 
PP\SGRRHA*RPA*WLGGPGGDSGGREEGGS/GELQRAMESKMG 
ELPLDINIQEPRWDQSTFLGRARHFFTVTDPRNLLLSGAQLEAS 
RNIVQNYR 








KLLGDNt El. F ^ J r T P] ) FD - DP Q 

OGVIDMKNAVIGNNKQKANLIVLGAVPRLLYLLQQETSSTELKT 

ECAWLGSLAMGTENNVKSLLDCHIIPALLQGLLSPDLKFIEAC 

LRCLRTIFTSPVTPEELLYTDATVIPHLMALLSRSRYTQEYICQ 

IFSHCCKGPDHQTILFNHGAVQNIAHLLTSLSYKVRMQALKCFS 

VLAFENPQVSMTLVNVLVDGELLPQIFVKMLQRDKPIEMQLTSA 

ETLAYL I EPDVELQRIAS ITDHLIAMLADYFKYPSSVSA1 TDI K 
RLDHDLKHAHELRQAAFKLYASLGANDEDIRKKVSLGEGKPPVL 
TASRQGVTST 


«... 


310 


327 


DSVTFDDLAVDFTPKEWTLLDPTQRNLYRDVMLENYXNLATVGY 
QLFKPSLISWLEQEESRTVQRGDFQASEWKVQLKTKELALQQDV 
LGEPTSSGIQMIGSHNGGEVSDVKQCGDVSSEHSCLKTHVRTQN 
SENTFECYLYGVDFLTLHKKTSTGEQRSVFSHVWKKPSSLNPDV 
VCQKNRCTRKKKAF * LQLTLGKSFH* SIHT 


6687 




915 


EAMLEAPYKKEEDEQQRKEVKKDYPSNTTSSTSNSGMETSGSST 
IGETSNRSRDRDRYRRRNSRSRSPGRQCRHRSRSWDRRHGSESR 
SRDHRREDRVHYRSPPLATGEPVDNLSPEERDARTVFCMQLAAR 
IRPRDLEDFFEAVGKVRDVRIISDRNSRRSK3IAYVEFCEIQSV 
PLAIGLTGQRLLGVP 1 IVQASQAEKNRLAAMANNLQKGNGGPMR 
IjYVGSLHFNITEDMLRGIFEPFGKV 


6688 


1025 


1 


AEVPNYPRVFHKCPDSCWRFKFQPIQLQPYILLSFSSEKPPISF 
SEPGLPR/SATARMATAAAPPNSSIDLPSDSGMGFISPAGDSLD 
LPSDGGTGFFSLAGDSSSTRLSSLAFISFSLSSVSVGSSAGTTS 
STSVGSWAAFTSSSSSSTNRDVAGLDFSTVTTSVSGSLVPSRE 
VAVICGSKGAGASGSASCSSRAGKTTEATAASSMPSGTSSFSTC 
TMSELEELFSLFSPAPLLSKLFTSSGSIAICCQDSGPSDTGRLS 
VCQLWLADSDTGKLSDCQEWTVGDSGGLTCPELSLGRM*MSLL 
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SEQ 
ID 
NO: 


Predicted 
nucleotide 
corresponding 

sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 

sequence 


Ammo acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 

S=Serine, T= Threonine, 'v=Valine, 
W=Tryptophan, Y= Tyrosine, X=Unknown, *=3top 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SSAVIPGYSSSSDSRLNTVPTVDLLCPFQTKSST 




640 


1299 


SSSASYATSATSISDTAFSGSLKLKHGLLSALDSSSRTS*STSS 
AEDSTFRICSPSVSDTSSDSSGSKDNVLILFSKVSI*SCFSLSS 
FFSDSISFCFSSSSFCKR* FVSSKVSCNALLSSRLSNGPGGSSK 
QRNSLTARQLAMSL*ATKF*RNACNPNCLSSKKSAL*LSLNQRF 
GGSAS RKPGN I SFNSQKCSALS YCCNFVI KPREVSVSS ENYPAF 


SG90 


1 


442 


gtrgkmaatlgplgswqqwrrclsardgsrmlllllllgsgqgp 
qqvgagqtfeylkrehslskpyqgvgtgssslwnlmgnamvmtq 
yirltpdmqskqgalwnrvpcflrdwelqvhf:<ihgqgkknl\h 
gdglaiwytkdrmqp 








DFTWEEWQLLGAAQKDLYRDVMLENYSNLVAVGYQASKPDALFK 
LEQGEQLWTIEDGIHSGACSDIWKVDHVLERLQSESLVNRRKPC 
HHHDAFENIVHCSKSQFLLGQNHDIFDLRGKSLKSNLTLVNQSK 
GYEIKNSVEFTGNGDSFLHANHERLHTAIKFPASQKLISTKSQF 
ISPKHQKTRKLEKHHVCSECGKAFIKKSKLTDHQVMHTGEKPHR 

HQKTHTGEXPYICSECGKGFIQKGNLiVKQRlli'i'GEKPYICISrEC 
/GKGFIQKTCLIAHQRFHTER 


6S92 


178 


939 


W I KEGELS LWERFCAN I I XAGPMPF HI [ AFTMnGNRRYAKKCQVE 

DGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQEL 
IAQAVQATKNYNKCFLNVCFAYTSRHEISNAVREMAWGVEQGLL 
DPSDISESLLDKCLYTNRSPHPDILIRTSGEVRLSDFLLWQTSH 


6593 


178 


939 


WIKEGELSLWERFCANIIKAGPMPKHIAFIMDGNRRYAKKCQVE 
RQEGHS QGFNKLAETLRWCLNLGI LEVTVYAFS IENFKRS KS EV 
DGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQEL 
IAQAVQATKNYNKCFLNVCFAYTSRHEISNAVREI4AWGVEQGLL 
DPSDISESLLDKCLYTNRSPHPDILIRTSGEVRLSDFLLWQTSH 


6694 




813 


SLLLHLAPPGAYTPSQPLSSVSTETASSVRRQAAESRQHELPVR 
EVHSLGQILPQDGLTAEAGPPEAQDPWGSPGISLPAAHIGFAAA 
LAVGPSGCHTEP\FDEVWPSLFLGDAYAARDKSKLIQLGITHW 
NAAAGKFQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLP 


66 95 


292 


813 


SLLLHLAPPGAYTPSQPLSSVBTETASSVRRQAAESRQHELPVR 

LAVGPSGCHTEP\FDEVWPSLFLGDAYAARDKSKLIQLGITHW 
NAAAGKFQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLP 


6S9S 




782 


PRVRGRVGERWAFLSVPAAMSSEMEPLLLAWSYFRRRKFQLCAD 
LCTQMLEKSPYDQAAWILKARALTEMVYIDEIDVDQ3GIAEMML 
DENAIAQVPRPGTSLKLPGTNQTGGPSQAVRPITQAGRPITGFL 
RPSTQSGRPGTMEQAIRTPRTAYTARPITSSSGRFVRLGTASML 
TSPDGPF INLSRLNLTKYS QKPKLAKAL IEYI FHHENDVKTALD 
LAALS TEHSQYKDWWWK/DQ IEKCYYRVGMYREAE KQ IKS S 


6697 


3 


782 


PP1FLRRLNSRALRPGSRKVMAWPASLSGQDVGSFAYLTIKDR 
IPQILTKVIDTLHRHKSEFFEKHGEEGVEAEKKAISLLSKLRNE 
LQTDKPFIPLVEKFVDTDIWNQYLEYQQSLLNESDGKSRWFYS? 
WLLV\ECYMYRRIHEAI\IQSPPIDYFDVFKESKEQNFYGSQES 
IIALCTHLQQLIRTIEDLD\ENQLKDEFFKLLQISLWGEISVDL 
SL\SGGESSSQNTNVLNSLEDLKPFILLNDMEHLWSLLSNCK 


6598 


668 


754 


VG S CACAG 3CKCKECKCTSCKKSECRAFP 


6699 


325 


492 


EC-ELP / PARRVLPRAMTASAQPRGRRPGVGVGVWTSCKHPRCV 
LLGKRKGSVGAGSFQLPGGHLEFGETWEECAQRETWEEAALHLK 
NVHFASWNSFIEKENYHYVTILMKGEVDVTHDSEPKNVEPEKN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ESKRI I YNHAFFFQESKWSGGILQ 


6700 






TQCWRS3TPGMRTHFRTQP /RLECGQGFSQQEMGHCMDTUEr'IQ — 

FPFVCPRDKPVCVNTYGSYRCRTNKKC3RGYEPNEDGTACVERT 

LLLGLCNLLGK 


5701 


2 


1485 


AAAGPRTRVRRAAAFEGQPSPSPGLGPTSDKAAAPRTPKRRRLW 
RQRQ/HPAMLCYVTRPDAVLMEVEVEAKANGEDCLNQVCRRLGI 
IEVDYFGLQFTGSKGESLWLNLRNRISQQMDGIAPYRLKLRVKF 
FVE PHL I LQEQTRHI FFLHI KEALLAGHLLCS PEQAVELSALLA 
QTKFGDYNQNTAKYNYEELCAKELSSATLNSIVAKHKELEGTSQ 
ASAEYQVLQIVSAMENYGIEWHSVRDSEGQKLLIGVGPEGISIC 
KDDFSPINRIAYPWQMATQSGKNVYLTVTKESGNSIVLLFKMI 
STRAASGLYRAITETHAFYRCDTVTSAVMMQYSRDLKGHLASLF 
LNENINLGKKYVFD I KRTS KEVYDHARRALYNAG WDLVSRKNQ 

VCCEEEINSTFCPCGHTVCCESCAAQLQVGESAAHFCLQPHLSL 
LLTGSRSQVLAR 


6702 


397 


1971 


PLAKFLKLDLVNVLCLPMEDVFLFYRTCFCSMGLGSSCHLSLPK 
RAEALLCSRKATWRDLVAVRMAEEQEFTQLCKLPAQPSHPHCV 
NNTYRSAQHSQALLRGLLALRDSGILFDWLWEGRHIEAHRIL 
LAASCDYFKGMFAGGLKEMEQEEVLIHGVSYNAMCQILHFIYTS 
ELELSLSNVQETLVAACQLQIPEIIHFCCDFLMSWVDEENILDV 
YRLAELFDLSRLTEQr.DTYII.KNFVAFSRTDKYRQLPLEKVYSL 
LSSNRLEVSCETEVYEGALLYHYSLEQVQADQISLHEPPKLLET 

SPQTELRSDFQCWGFGGIHSTPS\MSSATRPKYLNPLLGEWKH 
FTASLAPRMSNQGIAVLNNFVYLIGGDKNVQGFRAESRCWRYDP 
RHNRWFQIQSLQQEHADLSVCWGRYIYAVAGRDYHNDLNAVER 
YDPATNSWAYVAPLKREVYAHAGATLEGKMYITCGRKGRIT 


S703 


45 


1244 


GVGPRAAAMPLELELCPGRWVGGQHPCFIIAEIGQNHQGDLDVA 
KRMIRMAKECGADCAKFQKSELEFKFNRKALERPYTSKHSWGKT 
YGEHKRHLEFSHDQYRELQRYAEEVGIFFTASGMDEMAVEFLHE 

TSSWDVLGRVRTSKEKVLMVLVLDYSGRPMVISSGMQSMDTMKQ 
VYQIVKPLNPNFCFLQCTSAYPLQPEDVNLRVISEYQKIjFPDIP 
IGYSGHETGIAISVAAVALGAKVLERHITLDKTWKGSDHSASLE 
PGEIiAELVRSVRLVERALGSPTKQLLPCEMACNEKLGKSWAKV 
KIPEGTILTMDMLTVKVGEPKGYPPEDIFNLVGKKVLVTVEEDD 
TIMEE 


6704 


82 


1007 


I'MNTRNRWWSGLGASPASRPTRDPQDPSGRQGELSPVEDQREG 
LEAAPKGPSRESWHAGQRRTSAYTLIAPNINRRNEIQRIAEQE 
LANLEKWKEQNRAKPVHLVPRRLGGSQSETEVRQKQQLQLMQSK 
YKQKLKREESVRIKKEAEEAELQKMKA1QREKSNKLEEKKRLQE 
NLRREAFREHQQYKTAEFL/RQTEHRIARQKCLSKCCLWPTILN 
MGQKLGLQ\DSLKAEENRKLQKMKDEOHOKSELLELKRQQQEQE 
RAK I HQTEHRR VNNAFLDRLQGKS QPGGLEQSGGCWNMWS GNS W 
GI 


6705 


2 


786 


RLCRNSARVPCGWSASRSLGEGAGFIGPLRGPHPRAGGTGTSFT 
SYKRKGGIMSTIAAFYGGKSILITVATGFLGKELMEKLFRTSPD 
LKVIYILVRPKAGQTLQHRVFQILDSKLFEKVIEWPNVHEKIR 
AI YADLNQTTDFAI S KEDMQELLS CTNI I FHCAAT VRFDDTLRHA 
VQLNVTATRQLLLMASQMPKLEAFIHISTAYSNCNLKHZDEVIY 
PCPVEPKKIIDSLEW\LDDAIIDEITPKLIRDWPNIYTYTK 






531 


FTHSSSSHSQEMLGKLNMLRNDGHFCDITIRVQBKIFRAHKVVL 
AACSDFFRTKLVGQAEDENKNVLDLHHVTVTGF I PLLEYAYTAT 
LSINTENIIDVLAAASYMQMFSVASTCSEFMKSSILWNTPNSQP 
EK 
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NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


- , J rtsa end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Hi3tidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6707 


2233 


1343 


YWSGIGYELQHFHWRKFHFEKKGPPSTCQERLY3SRSRWPCIS* 
GMVWGWTAWGSW*GGQLRCVCVCTSHSSDSTRSSQRASKCHS 
FFILSQ*KT*SSWENWVFAKYSRIYSYGHSCSKGRGD*DFK*NV 
SQAR*SRFCGLCNPCGHCGLDINLRGGSSPWTDKHSCVHNNLLC 
NRRVFSLLCEGPGHCYQGAVCREACAAASPGIiDSAAEPHRLCEH 
TD*LPK*GPGYIQHFHCDSNILCILYNISFNLFSYSF*GVARYA 
C*RCHWYFEWLLYNKCCDILVACL*RRQL*SSQ 


.» 


US , 


1729 


TVGSWSRSGRSPPVGRQLLLTGRGAQAAGSPQGGMALQVEbVPT 
GEIIRWHPHRPCKLALGSDGVRVTMESALTARDRVGVQDFVLL 
ENFTSEAAFIENLRRRFRENLIYTYIGPVLVSVNPYRDLQIYSR 
QHMERYRGVSFYEEPPHLLAVADTVYRALRTERRDQAVMISVES 
GAGKTDATKRLLQLYAETCPAPQRGGAVRDRLLQSNPVLEAFGN 
AKTLRNDNSSRFGKYMDVQFDFKGAPVGGHILSYLLEKSRVVHQ 
NHGERNFHIFYQLLEGGEEETLRRLGLERNPQSYLYLVKGQCAK 
VSS INDKSDWKWRKALTVIDFTEDEVEDLLS IAASVLHLGNIH 
FAANEESNAQVTTENQLKYLTRLLSVEGSTLREALTHRKIIAKG 
EELLSPLNLEQAAYARDAIAKAVYSRTFTWLVGKINRSLASKDV 
ESPSWRSTTVLGLLEIYGFEVFQHNSFEQFCINYCNEKLQQLFI 
ELTLKSEQEEYEAEG I AWE PVQ Y FNNKI I CDLVEE KFKGI I \ S I 
LDE\ECLRPGE 


6709 


3 


894 


PPHEHLFPSGERGFF FT GPGKT r ,t EKKGRGAEK 
TAAKMEKKVSKRSRKEEEDLEALIAHFQTLDAKRTQTVEIjPCPP 
PSPRLNASLSVHPEKDELILFGGEYFNGQKTFLYNELYVYNIRK 
DTWTKVDI PS P P PRRCAHQAWVPQGGGQLWVFGGEFAS PNGEQ 
FYHYKDLWVLHLATKTWEQVKSTGGPSGRSGHRMVAWKRQLILF 
GGFHESTRDYIYYNDVYAFNLDTFTWSKLSPSGTGPTPRSGCQ\ 
IPSLPFAASSVYGGYSKQRVKKDVDKGTRHSDMF 


6710 


158 


930 


RHKMTNYRVESSSGRAARKMRLALMGPAFIAAIGYIDPGNFATN 
IQAGASFGYQLLWWVWAHLMAMLIQILSAKLGIATGKNLAEQI 
RDHYPRPWWFYWVQAEIIAMATDLAEFIGAAIGFKLILGVSLL 
QGAVLTGIATFLILMLQRRGQKPLEKVIGGLLLFVAAAYIVELI 
FSQPNLAQLGKGMVIPSLPTSEAVFLAAGVL\GATIMPHVI/YI 
WHSSLTQHLHGGSRQQRYSATKWDVAIAMTIAGFVNLAIMATAA 
SELNFYGHTGVA 


6711 


3 


347 


VTECKTMTCKMSQLERNI*TMINTLHHYSVKLGHPDTLIHGEFK 
ELVRTDLHNILMKENKNDQAI*HIMEDLDTNAHMQIIFKELIML 
MAMLTWSYHDNMHDADYGPGQQHRPG 


6712 


118 


578 


PHGQKRTRYPQVRAPGQQPQAQLAMALCLKQVFAKDKTFRPRKR 
FEPGTQRFELYKKAQASLKSGLDLRSWRLPPGEWIDDWIAVHV 
VDFFNRI NL I YGTMAERCS * TSCPVMAGGPRYEYRWQDERQYRR 
PAKLSAPRYMALLMDWIESLI 


6713 


2485 


3 


QARGSDSEDGEFEIQAEDDARARKLGPGRPLPTFPTSECTSDVE 
PDTREMVRAQNKKKKKSGGFQSMGLSYPVFKGIMKKGYKVPTPI 
QRKTIPVILDGKDWAMARTGSGKTACFLLPMFERLKTHSAQTG 
ARALILSPTRELALQTLKFTKELGKFTGLKTALILGGDRMEDQF 
AALHENPDI I IATPGRLVHVAVEMSLKLQSVEYWFDEADRLFE 
MGFAEQLQEIIARLPGGHQTVLFSATLPKLLVEFARAGLTEPVL 
IRLDVDTKLNEQLKTSFFLVREDTKAAVIjLHLLHNWRPQDQTV 
VFVATKHHAEYLTELLTTQRVSCAHIYSALDPTARKINLAKFTL 
GKCSTLIVTDLAARGLDIPLLDNV1NYSFPAKGKLFLHRVGRVA 
RAGRSGTAYSLVAPDEIPYLLDLHLFLGRSLTLARPLKEPSGVA 
GVDGMLGRVPQSWDEEDSGLQSTLEASLELRGLARVADNAQQQ 
YVRSRPAPSPESIKRAKEMDLVGLGLHPLFSSRFEEEELQRLRL 
VDSIKNYRSRATIFE1NASSRDLCSQVMRAKRQKDRKAIARFQQ 
GQQGRQEQQEGPVGPAPSRPALQEKQPEKEEEEEAGESVEDIFS 
E WGRKRQRSGPNRGAKRRREEARQRDQE FYI P YR PXDFDS ERG 
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SEQ 
NO: 


Predicted 
beginning 

corresponding 

residue of 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutaniine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








LSISGEGGAFEQQAAGAVLDLMGDEAQNLTRGRQQLKWDRKKKR 
FVGQSGQEDKKKIKTESGRYISSSYKRDLYQKWKQKQKID*S*L 
GRRRGILTRRRPRTEEVGEARPLAQAGCIPGPHAPRHPLQAESA 
LELKTKQQ I LKQRRRAQKAALS LQRWWPQAALCPQ 


S714 


169 


1416 


NNCQELLPPPPAPMAHIPSGGAPAAGAAPMGPQYCVCKVELSVS 

SKKFVLDYHFEEVQKLKFALFDQDKSSMRLDEHDFLGQFSCSLG 
TIVSSKKITRPLLLLNDKPAGKGLITIAAQELSDNRVITLSLAG 
RRLDKKDLFGKSDPFLEFYKPGDDGKWMLVHRTEVIKYTLDPVW 
KPFTVPLVSLCDGDMEKPIQVMCYDYDNDGGHDFIGEFQTSVSQ 
MCEARDSVPLEFECINPKKQRKKKNYKNSGIIILRSCKIKRDYS 

SAIWAVGQIIQDYDSDKMFPALGFGAQLPPDWKV3HEFAINFNP 
TNPFCSGVDGIAQAYSACLP 








GKPKHLGVPNGRMVLAVSDGELSSTTGPQGQGEGRGSSLS1HSL 
PSGPSSPFPTEEQPVASWALSFERLLQDPLGLAYFTEFLKKEFS 
AENVTFWKACERFQQ I PASDT 


6716 


1 


176 


GAGGPAPRSFGSKEPRAALERDKMSARAAAAKSTAMEETAIWEQ 
HTVTLHRVSLCCSK 


6717 




896 


YDYSQQGRFVPPDMMQPQQPYTGQIYQPTQAYTPASPQPFYGNN 
FEDEPPLLEELGINFDHIWQKTLTVLHPLKVADGSIMNETDLAG 
PMVFCLAFGATLLLAGKIQFGYVYGISAIGCLGMFCLLNLMSMT 
GVSFGCVASVLGYCLLPMILLSSFAVIFSLQGMVGIILTAGIIG 
WCSFSASKIFISALAMEGQQLLVAYPCALLYGVFALISVF 


6718 


290 


599 


KQSSTVPGTILPSLKWHNSGLCKFPETGGKMTTFKEGLTFKDVA 
LEKEKKLDIMKTATQ 


6719 


1 


691 


PTRPEEQDREDGKCHKMEMNPISGNLNCDPIAMSQCSSDHGCET 
DLDSDDDKIEKPNNFMKDSASQDNGLSRKISRKRVC3SDSDSSL 
QWKKSSKARTGLLRITRRCAATAANKIKLMSDVEDVSLENVHT 
RSKNGRKKPLHLACTTAKKKLSDCEGS^/HCEVPSEQYACEGKPP 
DPDSEGSTKVLSQALNGDSDSEDMLNSEHKHRHTNIHKIDAPSK 
RKSSSVTSSG 


6720 


3 


822 


HEVAEEAGGTVYPQRGTMPGTKRFQHVIETPEPGKWELTGYEAA 
VPITEKSNPLTQDLDKADAENIVRLLGQCDAEIFQEEGQALSTY 
QRLYSESILTTMVQVAGKVQEVLKEPDGGLWLSGGGTSGRMA- 1 
LMSVSFNQLMKGLGQKPLYTYLIAGGDRSWASREGTEDSALHG 
IEELKKVAAGKKRVIVIGISVGLSAPFVAGQMDCCMNNTAVFLP 
VLVGFNPVSMARHPFPPPRILRSLTVFPSLRAPHYQITSLLFSM 
SWTLISE 


6721 


3 


822 


HE\i A F, E AG3TVYPQRGTMPGTKRFQKVIETPEPGKWELTGYEAA 
VPITEKSNPLTQDLDKADAEITIVRLLGQCDAEIFQEEGQALSTY 
QRLYSESILTTMVQVAGKVQEVLKEPDGGLWLSGGGTSGRMAF 
LMSVSFNQLMKGLGQKPLYTYLIAGGDRSWASREGTEDSALHG 
IEELKKVAAGKKRVIVIGISVGLSAPFVAGQMDCCMNNTAVFLP 
VBVGFNPVSMARHPFPPPRILRSLTVFPSLRAPHYQITSLLFSM 
SWTLISE 


6722 




390 


I 3WS] ^TWQALPMAVLFLLLFLCGTPQAADNMQAIYVALGEAVE 
LPCPSPSTLHGDEHLSWFCSPAAGSFTTLVAQVQVGRPAPDPGK 
PGRESRLRLLGNYSLWLEGSKEEDAGRYWCAVLGQHHNYQNW 


6723 


173 


659 


VCQYCTARMADFGISAGQFVAWWDKSSPVEALKGLVDKLQALT 
GNEGRVSVENIKQLLQSAHKESSFDIILSGLVPGSTTLHSAEIL 
AEIARILRPGGCLFLKEPVETAVDNWSKVKTASKLCSALTLSGL 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

sequence 


(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VEVKELQREPLTPEEVQSVREHLGHESDNL 


S724 


173 


659 


VCQYCTARMADFG I SAGQFVAWWDKSS PVEALKGLVDKLQALT 
GNEGRVSVENIKQLLQSAHKESSFDIILSGLVPGSTTLHSAEIL 
AEIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALTLSGL 
VEVKELQRE PLTPEEVQS VREHLGHESDNL 


6725 






ASWELVDPTPDLQALFVQFNDQFFWGQLEAVEVKWSVRMTLCAG 
I CS YEGKGGMCS IELS EPLLKLRPRKDLVEVFFV 


5726 


98 


714 


HLQKMERKINRREKEKEYEGKHNSL3DTDQGKNCKSTLMTUTVG 

LFRHVIMFLRNGELLLPEGFRENQLLAQEAEFFQLKGLAEEVKS 
RWEKEQLTPRETTFLE ITDNHDRS QGLRI FCNAPDF ISKIKSKI 
VLVSKSRLDGFPEEFSISSNIIQFKYFIK 


5727 




831 


FRGMGDERPHYYGKHGTPQKYDPTFXGP 1 YNRGCTD 1 1 CCVFLL 
LAIVGYVAVGIIAWTHGDPRKVIYPTDSRGEFCGQKGTKNENKP 
YLFYFNIVKCASPLVLLEFQCPTPQICVEKCPDRYLTYLNARSS 
RDFEYYKQFCVPGFKNNKGVAEVLRDGDCPAVLIPSKPLARRCF 

RQLAMRIFEDYTVSWYWDIISLGIAMAMSLLFIILLRFLAGIMG 
RGMI IMGILVLGY 


6728 


486 


935 


FCSSWLRSLADSSLSWKMFLVGLTGGIASGKSSVIQVFQQLGCA 
VIDVDVMARHVVQPGYPAHRRIVEVFGTEVLLENGDINRKVLGD 
LIFNQPDRRQLLNAITHPEIRKEMMXETFKYFLREPRTSPRGKK 
HVPSALKEADSLMRRDT 


6729 


259 




VGLTGAQSGRTASMGRDQRAVAGPALRRWLLLGTVTVGFLAQSV 
LAGVKKFDVPCGGRDCSGGCQCYPEKGGRGQPGPVGPQGYNGPP 
GLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVS3FPGADGI 
PGHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQG 
PKGQKGEPYALPKEERDRYRGEPGEPGLVGFQGPPGRPGHVGQM 
GPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKGEKGDVGQPGPN 
GIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEE 
GIM 


6730 


781 




RKFKEVAEAYEVLSNDEKRDIYDKYGTEGLNEF 


6731 




446 


GIRKRLHGAWPRVEVGCPWETRESEGVHLERPTSPLKNNDEGS 
LDIYAGLDSAVSDSASKSCVPSRNCLDLYEEILTEEGTAKEATY 
WDLQVEYGKCQLQMKELMKKFKEIQTQNFSLINEIIQSLKKNZSA 
LIKTARVE1NRKDEEI 


6732 






AQRQVCYTAATTQAAAPATRNCLPDHSGHRPTPPRSHRHHRQEN 
LGSIKPSSRSTKATSTTMAGDGRRAEAVREGWGVYVTPRAPIRE 
GRGRLAPQNGGSSDAPAYRTPPSRQGRREVRFSDEPPEVYGDFE 
PLVAKERSPVGKRTRLEEFRSDSAKEEVRESAYYLRSRQRRQPR 
PQETEEMKTRRTTRLQQQHSEQPPLQPSPVMTRRGLRDSHSSEE 
DEAS3QTDLSQTISKKTVRSIQEAPAVSEDLVIRLRRPPLRYPR 
YEATSVQQKVNFSEEGETEEDDQDSSHSSVTTVKARSR3SDESG 
DKTTRSSSQYIESFW 


6733 


613 


1311 


RSCRCVGMRSRNQGGESASDGHISCPKPSIIGNAGEKSLSEDAK 
KKKKSNRKEDDVMASGTVKRHLKTSGECERKTKKSLELSKEDLI 
QLL3IMEG3LQAREDVIHMLKTEKTKPEVLEAHYGSAEPEKVLR 
VLHRDAILAQEKSIGEDVYEKPISELDRLEEKQKETYRRMLEQL 
LLAEKCHRRTVYELEWEKHPCHTDYMNKSDDFTNLLEQERERLKK 
LLEQEKAYQARKE 


6734 


189 


551 1 SAAMFPVFSGCFQELQEXNKSLELVSFEEVAVHFTWEEWQDLDD 
| AQRTLYRDVMLETYSSLVSLGHCITKPEMIFKLECGAEPWIVEE 
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corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparacine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, y-poasible nucleotide deletion, 
\=possible nucleotide insertion) 








TLNLRLSGGSKKQVFSGICHRSLVELQEVHLV 


5735 




558 


KSRRAGVTKMSNPFLKQVFNKDKTFRPKRKFEPGTQRFELHKKA 
QASLNAGLDLRLAVQLPP3EDLNDWVAVHWDFFNRVI>ILiIYGTI 
XDGCT 


S736 


195 


808 


MNYELNFKREMPNI KS LGLTNLNFLLKRI.S SVLPLI TDYVYFEN 
SSSNPYLIRRIEELNKTASGNVEAKWCFYRRRDISNTLIMLAD 

IRGKCSVALLNETESVLSYLDKEDTFFYSLVYDPSLKTLLADKG 
EIRVGPRYQADIPEMLLEGTFFCVFAVL 


5737 


- ISC 


12 0 9 


PVIMPLHFSPGDIVRPSCCVSSSPKLRRNAHSRLESYRPDTDLS 
REDTGCNLQHISDRENIDDLNMEFNPSDHPRA3TIFLSKSQTDV 

CVALAIYYHIKNRDPDGRMLLD1FDENLHPLSKSEVPPDYDKHN 
PEQKQIYRFVP.TLFSAAQLTAECAIVTLVYLERLLTYAEIDICP 
ANWKRIVLGAILLASKWDDQAWNVDYCQILKDITVEDMNELE 
RQFLELLQFNINVPSSVYAKYYFDLRSLAEANNLSFPLEPLSRE 
RAHKLEAI SRLCEDKYKD 7 .RRSARKRSAP ADNLTLPRWS P ^1 1 S 


S738 




653 


CACAttQPARAEVGAATALPVRWASGSMAPSGSLAVPLAVLVLLL 
WGAPWTHGRRSNVRVITDENWRELLSGDKMIEFYAPWCPACQNT, 
QPEW33FAEWGEDLEWIAKVDVTEQPGLSGRFIITALPTIYHC 
KDGE FRRYQGPRTKKDFINFISDKEWKS IEPVSSWF 


6739 


3 


631 


SWPDMAEEEVAKLKKHLMIJ.RQEYVKLQKKLAETEKRCALLAAQ 
ANKESSSESFI SRLLAI VADLYEQEQYSDLKI KV3DRHI SAHKF 

REDDVFLTELMKLANRFQLQLLRERCEKGVMSLVNVRNCIRFYQ 
TAEELNASTLMNYCAEI IASHWVSEVE3VHKAL 


6740 


• 


631 


SWPDMAEEEVAKLEKHLMLLRQEYVKLQKKLAETEKRCALLAAQ 
ANKESSSESFISRLLAIVADLYEQEQYSDLKIKVGDRHISAHKF 
VLAARSDSWSLANLSSTKELDLSDANPEVTMTMLRWIYTDELEF 
REDDVFLTELMKLANRFQLQLLRERCEKGVMSLVNVRNCIRFYQ 


6741 


141 


960 


PLTLPFSSRARAGHTMNTSPGTVGSDPVI1ATAGYDHTVRFWQA 
HSGICTRTVQHQDSQVNALEVTPDRSMIAAAVQPVSLGYQHIRM 
YDLNSNNPNP I I S YDGVNKNIASVGFHEDGRWMYTG3EDCTARI 

DLKTDHNEQL I PE PE VS ITSAHIDPDAS YMAAVNSTLVPFS CLL 
PLAIGILQEGEFESLARRGLLFLACQGNCYVWNLTG3IGDEVTQ 
LIPKTKIP 


6 742 


141 


» 


PU1 LP F S SRARAGHTMNTS PGTVGSDPVI LATAGYDHTVRFWQA 
HSGICTRTVQHQDSQVNALEVTPDRSMIAAAVQPVSLGYQHIRM 
YDLNSNNPNPIISYDGVNK1JIASVGFHEDGRWMYTGGEDCTARI 
WD1RSRKLQCQRIFQVNAPINCVCLHPKQAELIVGDQSGAIHIW 
DLKTDHNEQLIPEPEVSITSAHIDPDASYMAAVNSTLVPFSCLL 
PLAIGILQEGEFESLARRGLLFLACQGNCYVWNLTGGIGDEVTQ 
LIPKTKIP 


6743 


1 


412 


i c E KS LHLEGDPNPSAAPTSTCAPRKMPKRI S I S KQLAS VK 
ALRKCSDLEKAIATTALIFRNSSDSDGKLEKAIAKDLLQTQFRN 
FAEGQETKPKYREILSELDEHTENKLDFEDFMILLLSITVMSDL 
LQNIR 


5744 


95 


1343 


RTPARNRCAGCEVLSRFSSPNKASSFALQSAGGGLPAVRALRRD 
TVGYGMDEVEQDQHEARLKELFDSFDTTGTGSLGQEELT 
DLCHMLSLEEVAPVLQQTLLQDNLLGRVHFDQFKEALILILSRT 
I SNEE HFQEPDCSLEAQPKYVRGGKRYGRRSLPEFQESVEEFPE 
VTVIEPLDEEARPSHIPAGDCSEHWKTQRSEEYEAEGQLRFWNP 
DDLNASQSGSSPPQDWIEEKLQEVCEDLGITRDGHLNRKKLVSI 
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Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, ' 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








CEQYGLQOTDGEMLEEVFHHLDPDGTMSVEDFFYGLFKNGKSLT 
PSASTPYRQLKRHLSMQSFDESGRRTTTSSAMTSTIGFRVFSCL 
DDGMGHASVERILDTWQEEGIEHSQEILKALDFGLDGNIKLTEL 
TLALENELLVTKNS IHQACI 


6745 


1 




TFRDQGWAQRRRWLLGCASWESWEAA1AAGPGLPSSTARQQNNP 
AAGTECFAAVWARGTAMGSVLSTDSGKSAPASATARALERRRDP 
ELPVTSFDCAVCLEVLHQPVRTRCGHVFCRSCIATSLKNNKWTC 
PYCRAYLPSEGVPATDVAKRMKSEYKNCAECDTLVCLSEMRAHI 
RTCQKYIDKYGPLQELEETA 


5746 


110 


492 


GATGAMAESAPARHRRKRRSTPLTSSTLPSQATEKSSYFQTTEI 
SLWTWAAIQAVEKKMESQAARLQSLEGRTGTAEKKIjADCEKMA 
VEFGNQLEGKWAVLGTLLQEYGLLQRRLENVENLLENRN 


5747 


247 




EAVTFKDVAWFTEEE ^ jT KLY] [ 1 fFRNLLSVGH — 
QPFHRDTFHFLREEKFWMMDIATQREGNSVYA3VC 


S74S 


201 


665 


MTTFKE C it r 1 L _ T ^ j 
LSVGNQPFHQDTFHFLGKEKFWKMKTTSQREGNSGGKIQIEMET 
VPEAGPHEEWSCQQIWEQIASDLTRSQNSIRNSSQFFKEGDVPC 
QIEARLS I SXVQQXP YRCNECKQ 


S749 


95 


719 


RREVKGGDGVCPRARGSPQSQQFPSCAGGGEGLQQSGEALDGAM 
SAGGPCPAAAGGGPGGASGSVGAPG3VSMFRWLEVLEKEFDKAF 
VDVDLLLGEIDPDQADITYEGRQKMrSLSSCFAQLCHKAQSVSQ 
INHKLEAQLVDIiKSELTETQAEKVVLEKEVHDQLLQLHSIQLQL 
HAKTGQSADSGTIKAKLSGPSVEELERELKAN 


S750 


3 




ECESRRFGAKWVSASGALPRDTTGI.GSEQPSGDVAQSNRATMGT 
TAPGPIHLLELCDQKLMEFLCNMDNKDLVWLEEIQEEAERMFTR 
EFSKEPELMPKTPSQKNRRKKRRISYVQDENRDPIRRRLSRRKS 
RSSQLSSRR 


6751 


152 




PTKATEMAGASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIV 
NPKQPKETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEML 
QHAFEGYNVCI FAYGQTGAGKS YTMMGKQEKDQQGI I PQLCEDL 
FSRINDTTKDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVRE 
HPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATNMNETS 
SRSHAVFNIIFTQKRHDAETNITTEKVSKISLVDLAGSERADST 
GAKGTRLKEGANINKSLTTLGKVISALAEMDSGPNKWKKKKKTD 
FIPYRDSVLTWLLRENLGGNSRTAMVA^LSPADINYDETLSTLR 
YADRAKQIRCNAVINEDPHMKLIRELKDEVTRLRDLLYAQGLGD 
ITDMTNALVGMSPSSSLSALSSRNV 


6752 




1834 


RNCVPPLGCYRSRVKFHSDIKMQYSHHCEHLLERLNKQREAGFL 
CDCTIVIGEFQFKAHRNVLASFSEYFGAIYRSTSENNVFLDOSO 
VKADGFQKLLEFIYTGTLNLDSWNVKEIHQAADYLKVEEWTKC 
KIKMEDFAFIANPSSTEISSITGNIELNQQTCLLTLRDYNNREK 
SEVSTDLIQANPKQGALAKKSSQTKKKKKAFNSPKTGQWKTVQY 
PSDILENASVELFLDANKLPTPWEQVAQINDWSELELTSWEN 
TFPAQDIVHTVTVKRKRGKSQPNCALKEHSMSNIASVKSPYEAE 
NSGEELDQRYSKAKPMCNTCGKVFSEASSLRRHMRIHKGVKPYV 
CHLCGKAFTQCNQLKTHVRTHTGEKPYKCELCDKGFAQKCQLVF 
HSRMHHGEEKPYKCDVCNLQFATSSNLKIHARKHSGEKPYVCDR 
CGQRFAQAS TLTYHVRRHTGEKP YVCDTCGKAFAVS S S L I THS R 
KHTGEKPFICELCGWSYTDIKNLKKHKTKVHSGADKTLDSSAED 
HTLSEQDSIQKSPLSETMDVKPSDMTLPLALPLGTEDHHMLLPV 
TDTQSPTSDTLLRSTVNGYSEPQLIFLQQLY 


6753 


2 


1305 


VPSLPYPPQKWAHTEFTTSSDSETANGIAKPDPVMPGGEEKAS " 
PFGIKLRRTNYSLRFNCDQQAEQKKKKRHSSTGDSADAGPPAAG 
SARGEKEMEGVALKHGPSLPQERKQAPS TRRDSAE PSS SRS VPV 
AHPGPPPASSQTPAPEHDKAANKMPLAQKPALAPKPTSQTPPAS 
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ID 
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nucleotide 
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to first 
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nucleotide 
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residue of 
amino acid 


Amino acid segment containing signal peptide - " 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLSKLSRPYLVELLSRRAGRPDPEPSEPSKEDQESSDRRPPSPP 
GPEERKGQKRDEEEEATERKPASPPLEATQQEKPSQTPEAGRKE 
KPMLQSRHSLDGSKLTEKVETAQPLWITLALQKQKGFREQQATR 
EERKQAREAKQAEKLSKEMVSVSVQPGSSSVSRAGSLHKSTALP 
EEKRPETAVSRLERREQLKKANTLPTSVTVEISYSSPAAPLVKE 
VSKRFSSPDDAPVSSEPAWLALAKRBCAKaWSDCPLIIK 


S754 


2 


413 


FVRRRRRRLGGPEVNTMSSLHKSRIADFQDVLKEPSIALEKLRE 
LSFSGIPCEGGLRCLCWKILLNYLPLERASWTS1LAKQRELYAQ 
FLREMIIQPGIAKANMGVSREDVTFEDHPLNPNPDSRWNTYFKD 
NEVLL 


S755 


298 


1343 


PGLQLQVALEADWFLDMPGGRRGPSRQQLSRSALPSLQTLVGGG 
CGNGTGLRNRNGSAIGLPVPPITALITPGPVRHCQIPDLPVDGS 
LLFEFLFFIYLLVALFIQYINIYKTVWWYPYNHPASCTSLNFHL 
IDYHLAAF ITVMLARRLVWAL I SEATKAGAASMI HYMVLI SARL 
VLLTLCGWVLCWTLVNLFRSHSVLNLLFLGYPFGVYVPLCCFHQ 
DSRAHLLLTDYNYWQHEAVEESASTVGGLAKSKDFLSLLLESL 
KEQFNNATPIPTHSCPLSPDLIRNEVECLKADFNHRIKEVLFKTS 
LFSAYYVAFLPLCFVKVSGYLTFMCFLDLCVNYINWVFLV 


5756 


180 


754 


lERALGSLPLSIPVSWGSLRTLKYQQQPLRPKVLLCQ'I'RVQCHD 
LRSLQPQPPGLKQSFCLRVLGLQTGATTPGLRDLTCKELIILTE 
REAQKRKKRKEKESGMALTQGPLTFRDVAIEFSQEEWKSIiDPVQ 
:<ALYWDVMLENYRNLVFLGKDNFALEVK1CPRVFLYFLCCLSWE 
PFHYLTETEALLTHK 


G757 


2 


459 


NSRVEAPEAHSRESQGSDAMRKHbSWWWJATVCMLLFSHLSAVQ 

FGAEGNRYYEANYNQFPDGIHYNGCSEANVTKEAFVTGCINATQ 
AANQGEFQKPDNKLHQQVLW 


6758 


1 


1008 


ASGPEIj?GR:<FRDRAPWLPARLLRGVLAV>JVSLSALGP3SFCR^ 
RVPSLAQLGHSEAAPSPDDVRWSRVPDRCPEERDRAWPPPPPPS 
LPPSFRRNMANNSPALTGNSQPQHQAAAA^AQQQQQCGSGGATK 
PAVSGKQGNVLPLWGNEKTMNLNPMI LTN I LSS PYFKVQLYELK 
TYHEWDEIYFKVTHVEPWEKGSRKTAGQTGMCGGVRGVGTGGI 
VSTAFCLLYKLFTLKLTRKQVMGLITHTDSPYIRALGFMYIRYT 
QPPTDLWDWFESFLDDEEDLDVKAGGGCVMTIGEMLRSFLTKLE 
WFSTLFPRIPVPVQKNIDQQIKTRPRKI 


5759 


1 


513 


RKHNFIISLDGTSTRAFHPQTGLPLLSSPVPQRKTQSGCFDLDSS ' 
LLHLKSFSSRSPRPCLNIEDDPDIHEKPFLSSSAPPITSLSLLG 
NFEESVLWYRFDPLGIVDGFTAEVGASGAFCPTHLTLPVEVSFY 
SVSDDKAPSPYMGVITLESLGKRGYRVPPSGTIQWCVL 


S760 


239 




VL S KKKGLSAEEKRTRMME I FSETKDVFQLKDLEKIAPKEKGIT ' 

AMSVKEVLQSLVDDGMVDCERIGTSNYYWAFPSKflLHARKHKLE 

VLESQLSEGSQKHASLQKSIEKAKIGRCETEERT 


G761 


29 


1733 


ERTLRGLREVAAPSDVADAAVSRRGRCCCCLHCTQTQVAQDCPS 
S SS SVQRCELS LFQSLHTMTS KKLVNS VAGCADDALAGLVACNP 
ri Q] " C^HRVALRSDLDSLKGRVALLSGGGSGHEPAHAGFIGKG 
MLTGVIAGAVFTSPAVGSILAAIRAVAQAGTVGTLLIVKNYTGD 
RLNE -SLAREQARAEGIPVEMWIGDDSAFTVLKKftGRRGLCGTV 
^CALAEAGVGLEEIAKQVNWTKAMGTLGVSLSSCSVPG 
SKPTFELSADEVELGLGIHGEAGVRRIKMATADEIVKLMLDHMT 
NTT I SHVPVQPGSSWMMVNNLGGLSFLELGIIADATVRSLEG 
RGVKIARALVGTFMSALEMPGISLTLLLVDEPLLKLIDAETTAA 
AWPNVAAVSITGRKRSRVAPAEPQEAPDSTAAGGSASKRMALVL 
ERVCSTLLGLEEHLNALDRAAGDGDCGTTHSRAARAIQEWLKEG 
PPPASPAQLLSKLSVLLLEKMGGSSGALYGLFLTAAAQPLKAKT 
SLPAWSAAMDAGLEAMQKYGKAAPGDRTMLDSLWAAGQEL 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Qlycine, 
H=IIistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginina, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6762 


3 


613 


ASTISWRLCVAGAEARRPVPVAGERAGGGAMWFMYLLSWLSLFI 
QVAFITLAVAAGLYYLAELIEEYTVATSR11KYM1WFSTAVLIG 
LYVFERFPTSMIGVGLFTNLVYFGLLQTFPFIMLTSPNFILSCG 
LVWNHYLAFQFFAEEYYPFSEVT.AYFTFCLWIIPFAFFV'SLSA 
GENVLPSTMQPGDDWSNYFTKGKRGK 


S763 


2 


760 


SGPDFPGRRFRGCCCVRPPAGAGMELGGHWDMNSAPRLVSETAE 
RKQEQKTGTEAEAADSGAVGARRFLLCLYLGGFLDLFGVSMWP 
LLSLHVKSLGAS PTVAG IVGSSYG I LQLFS STLVGCWSDWGRR 
SSLLACILLSALGYLLLGAATWFLFVLARVPAGIFKHTLSISH 
ALLSDVVPEKERPLVIGHFNTASGVGFILGPWGGYLTELEDGF 
YLTAFICFLVFILNAGLVWFFPRREAKPGSTE 


S764 


80 


438 


LKKMDTMMLSVRNLFEQLVRRVEILSEGNEVQFIQLAKDFEDFR 
KKWQRTDHELGKYKDLLMKAETERSALDVKLKHARNQVDVEIKR 
RQRAEADCEKLERQ IQL IREMLMCDTSGS I Q 


S7SS 


3 


550 


arysrvdhfcrrrcravaraprfllqfpsgpsrhflaacvarwl 
rgsvlvsealsgsamdgivtevavgvkrgsdellsgsvlsspns 
nmssmwtangndskkfkgrdkmdgapsrvlhirklpgevtete 
vialglpfgkvtnilmlkgknqaflelateeaai™gnyysavt 

PKLRNQ 


S7SS 


1 


1287 


F ASLTWLWPLGEMKLIICEVEVISRHT.PALGLRNRGKGVR 
AVLSLCQQT3RSQPPVRAFLLISTLKDKRGTKYELRENIEQFFT 
KFVDEGKATVRI.KEPPVDICLSKANSSSLKGFLSAMRLAHRGCN 
VDTPVSTLTPVKTSEFENFKTKMVITSKKDYPLSKNFPYSLEHL 
QTSYCGLVRVDMRMLCLKSLRKLDLSHNHIKKLPATIGDLIHLQ 
ELNLNDNHLESFSVALCHSTLQKSLWSLDLSKNKIKALPVQFCQ 
LQELKNLKLDDHELIQFPCKIGQLINLRFLSAARNKLPFLPSEF 
RNLSLEYLDLFGHTFEQPKVLPVIKLQAPLTLLESSARTILHNR 
I PYGSHI I PFHLCQDLDTAKI CVCGRFCLNS F IQGTTTMNLHS V 
AJITWLVDNLGGTEAPI I S YFCSLGCYVNSSDI 


S7S7 


336 


919 


APMICLCSSDLQFRYKEAFLRDRGLQIGYCSVDDDPRMKHFLNV 
GRLQSDNEYKKDFAK5RSQFHSSTDQPGLLQAKRSQQLASDVHY 
RQPLPQPTCDPEQLGLRHAQKAHQLQSDVKYKSDLNLTRGVGWT 
PPGSYKVEMARRAAELANARGLGLQGAYRGAEAVEAGDHQSGEV 
NPDATE I LHVKKKKALLL 


6768 . 


2 




PGSTI S C YLLSEGS LPLCMQVACGEEKHRAPTMKTLRARFKKTE 
LRLSPTDLGSCPPCGPCPIPKPAARGRRQSQDWGKSDERLLQAV 
ENNDAPRVAALIARKGLVPTKLDPEGKSAFHL 


6769 


284 


396 


MSTPDFSTAEKMQELANEVSCLKAMLTIjMLQAMGQAD 


6770 




3 97 


QRNYQVIWSSTMAKLHDYYKDEWKKLMTEFKYNSVMQVPRVEK 
ITLNMGVGEAIADKKLLDNAAADLAAISGQKPLITKARKSVAGF 
KIRQGYPIGCKVTLRGERMWEFFERLITIAVPRIRDFRGLSAKS 


6771 


3 


3 78 


APAGTLAMTGKSVKDVDRYQAVLANLLLEEDNKFCADCQSKGPR 
WASWNIGVFICIRCAGIHRNLGVHISRVKSVNLDQWTQEQIQCM 
QEMGNGKANRLYEAYLPETFRRPQIDPYLFWSNLEG 


6772 


1 


1400 


AAAFLQGMTVNGFINTVITSL\ERRYDLHSYQSGLIASSYDIAA 
CLCLTFVSYFGGSG\HKPRWLGWGR\VLMGTGSLVFALPHFTAG 
P**GWKLDAGVRTCPANPR\PVCAG\HTSGLSRYQLVFMLGQFL 
HGVGATP L YTLGVT YLDENVKSS CS PIYIAI FYTAAILGPAAG Y 
L1GGALLK1YTEMGRRTELTTESPLWVGAWWVGFLGSGAAAFFT 
AVPILGYPRQLPGSQRYAVMRAAEMHQLKDSSRGEASSTPDFGKT 
IRDLPLSIWLLLKNPTFILLGLAGATEATLITGMSTFSPKFLES 
QFSLSASEAATLFGYLWPAGGGGTFLGGFFVNKLRLRGSAVIK 
FCLFCTWSLLGILVFSLHCPSVPMAGVTASYGGSLLPEGHLNL 
TAPCNAACSCQPEHYSPVCGSDGLMYFSLCHAGCPAATETNVDG 
QKVYRDCSCIPQNLSSGFGHATAGKCTST 
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SEQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Predicted end 

corresponding 
to first 
amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=T,ysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Giutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


5773 


1 


630 


PWEAPKEHKYKAEEHTWLTVTGEPCHFPFQYHRQLYHKCTHKG 
RPGPQPWCATTPNFDQDQRWGYCLEPKKVKDHCSKHSPCQKGGT 
CVNMPSGPHCLCPQHLTGNKCQKEKCFEPQLLRFFHKNEIWYRT 
EQAAVARCQCKGPDAHCQRLASQACRTNPCLHGGRCLEVEGHRL 
CHC?VGYTGPFCDVGE*GSGASRRPAPRWDGLAR 


S774 




389 


LTELSDQQYFLFFILSS/WVPTFLSMDVDGRVIKADSFSKIISS 
GLRIGFLTGPKPLIERVILHIQVSTLHPSTFNQLMISQ 


S775 


104 


614 


TCPSQLRVLTARGGRRAPSPQLWTLVLALIEEKWRSHRILRMNS 
GRPETMENLPALYT1FQGEVAMVTDYGAFIKIPGCRKQGLVHRT 
HMSSCRVDKPSEIVDVGDKVWVKLIGREMKNDRIKVSLSMKWN 
QGTGKDLDPNNV\SLSKKRGGGDPSRITLGRRSPLRLS 


S77S 


3 


1108 


HERHERHEGALSQDALLRIS 1 PLDSNMRPEKCRRFVHPQWQLLH 
LNGTFPNTSDADMEPCVDGWVYDRISFSSTIVTEWDLVCDSQSL 
TSVAXFVFMAGMMVGGILGGHLSDRFGRRFVLRHCYLQVAIVGT 
CAALAPTFLIYCSLRFLSGIAAMSLITNTIMLIAEWATHRFQAM 
GITLGMCPSGIAFMTLAGLAFA.RDWHILQLWSVPYFVIFLTS 
SWLLESARVILIINNKPEEGLKELRKAAHRSGMKNARDTLTLEIL 
I I' ' P'ELEAAQKKKP FLGERLHMPNI CKRI S LLPFTKFANFMA 
YFGLNIjHG/LKHLGNNVFLLQTLFGAV/TPPGQLVLHLGHWGSG 
RVS S RGRVNCLGLFVLQVW 


6777 


779 


63 


cffhgpawrdcevratfakkqgqsgiikciafspaqplyacgsy 

GRSLGLYAWDDGSPLALLGGHQGGITHLCFHPDGNRFFSGARKD 
AELLCWDLRQSGYPLWSLGREVTTNQRIYFDLDPTGQFLVSGST 
SGAVSVWDTDGPGNDGKPEPVLSFLPQKDCTNGVSLHPSLPLLG 
HCLPVSVCFLSPTESGGRRRGAGPSLGSPRRHVHLECRLQLWWC 
GGGARLQHP**SPRARKGR 


6778 


311 


805 


IQSITDESRGSIRRKWPANTRLRLNVP\EETAGOSE/ERSPEEE 
VQADPRIRSASPKCPTSSPFPKGRSPEGEGET\DPEKVHFHPGP 
KDKSVAEKN\KGP\SPVSSEGIKDFPSMKPEWENLNQSNVRRME 
T\AVRLNEVIVKKSRDAKLVLLNMPGPPRNRNGDENY 


6779 


2 


535 


RALRRQPRLLAANG IE PESMAI SE P I KGSRKP CVNKEELALKKP 
MAKCAWKGFREPPQDARAEAESPGGASESDQDGGHEgpPKKKAV 
AWVSAKNPAPMRKKKKVSLGPVSYVLVDSEDGRKKPVMPKKGPG 
SRREASDQKAPRGQCPAEATASTSRGPKAKPEGSPRRATNESRK 
V 




3 


403 


HEWDNKPEININLMSPGKEEISYIFEGDPIDTFVALVRVQDKD 
SGLNGEIVCKLHGHGHFKLQKTYENNYLILTNATLDREKRSEYS 
LTVIAEDRGTPSLSTVKHFTVQINDINDNPEHFQRSRYEFVISE 
K 


6781 




1269 


APTRPVFPTLQDLSSSKEPSNSLNLPHSNELCSSLVHPELSEVS 
SNVAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSANT 
SAALPTHLQSALMSTWTMPNAGSKVMVSEGQSAAQSNARPQFI 
TPVFINSSS I IQVMKGSQPSTIEAAPLTTNEGLMPPSVAWGPL 
HIPQNIKFSSAPVPPNALSSSPAPNIQTGRPLVLSSRATPVQLP 
SPPCTSSPWPSHPPVQQVKELNPDEASPQVNTSADQNTLPSSQ 
STTMVSPLLTNSPGSSGNRRSPVSSSKGKGKVDKIG2ILLTKAC 
KKVTGSLEKGEEQYGADGETEGQGLDTTAPGLMGTEQLSTELDS 
KTPTPPAPTLLKMTSSPVGPGTASAGPSLPGGALPTSVRSIVTT 
LVPSELISAVPTTKSNHGGIASESLAG 


6782 


3 


1327 


- . . I PAKPGKCLHEDPQSPPPLPAEKP IGNTFSTVSGKLS 
NVERTRNLESNHPGQTGGFVRVPPRLPPRPVKGKTIPTQQPPTK 
VPPERPPPPKLSATRRSNKKLPFNRSSSDMDLQKKQSNLATGLS 
KAKSQVFKNQDPVLPPRPKPGHPLYSKYMLSVPHGIANEDIVSQ 
NPGELSCKRGDVLVMLKQTENNYLECQKGEDTGRVHLSCMKLIT 
PLDEHLRSRPNPFSPPKAPSHAQKPVDSGAPHAWLHDFPAEQV 
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NO: 


beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


nucleotide 

corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unkr.own, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DDLNLTSGE IVYLLEKIDTDWYRGNCRNQIGIFPANYVKVIIDI 
PEGGNGKRECVSSHCVKGSRCVARFEYIGEQKDELSFSEGEIII 
LKEYVNEEWARGEVRGRTGIFPLNFVEPVEDYPTSGANVLSTKV 
PLKTKKEDSGSNSQVNSLPAEWCEALHSFTAETSDDLSFKRGDR 




3 


1750 


SYHHHHAQQSAAAS PNLTASOKTVTTTSMITTKTLPLVLKAATA 
TMPASWGQRPTIAMVTAINSQKAVLSTDVQNTPVNLQTSSKVT 
GPGAEAVQ IVAKNTVTLQVQATPPQP I KVPQF I P P PRLTPEPNF 
LPQVHPKPVAQNMIPIAPAPPPMLAAPQLIQRP\'MLTKFTPTTL 
PTSQNSIHPVRWNGQTATIAKTFPMAQZjTSIVIATPGTRLAGP 
QTVQLSKPSLEKQTVKSHTETDEKQTESRTITPPAAPKPKREEN 
PQKLAFMVSLGLVTHDHLEEIQSKKQERKRRTTANPVYSGAVFE 
PERKKSAVTYLNSTMHPGTRKRGRPPKYNAVLGFGALTPTSPQS 
SHPDSPENEKTETTFTFPAPVQPVSLPSPTSTDGDIHEDFCSVC 
RKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLK 
KEEAIPWPGTLAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 
LEQKVKQLSNSISKCMEMKNTILARQKEMHSSLEKVKQLIRLIH 
GIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCT 


6784 


3 


1750 


TMPASWGQRPTIAMVTAINSQKAVI.STDVQNTPVNLQTSSKVT 
GPGAEAVQ IVAKNTVTLQVQATPPQP I KVPQF I P P PRLTPRPNF 
LPQVRPKPVAQNNIPIAPAPPPMLAAPQLIQRPVMLTKFTPTTL 
PTSQNSIHPVRWNGQTATIAKTFFMAQLTSIVIATPGTRLAGP 
QTVQLSKPSLEKQTVKSHTETDEKQTESRTITPPAAPKPKREEN 
PQKLAFMVSLGLVTHDHLEEIQSKRQERKRRTTANPVYSGAVFE 
PERKKSAVTYLNSTMHPGTRKRGRPPKYNAVLGFGALTPTSPQS 
SHPDSPENEKTETTFTFPAPVQPVSLPSPTSTDGDIHEDFCSVC 
RKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLK 
KEEAIPWPGTLAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 
LEQKVKQLSNSISKCMEKKNTILARQKEMHSSLEKVKQLIRLrH 
GIDLSKPVDSEATVGAISNGPDCTPPANAA.TSTPAPSPSSQSCT 
ANCNQGEETK 


S785 


1 


528 


-iGNrVLHYCSMYSKPECLKLLLRSKPTVDIVNQAGETALDIAKR 
LKATQCEDLLSQAKSGKFNPHVHVEYEWNLRQEEIDESDDDLDD 
KPSPVKKERSPRPQSFCHSSSISPQDKLALPGFSTPRDKQRLSY 
GAFTNQIFVSTSTDSPTSPTTEAPPLPPRNAGKGPTGPPITPHR 


6786 


1820 


1397 


t O.VJ 2j?IRELANHVSRDFKDI\TRKLTVARFYGGTSYQSQ 
INHIRNGIDILVGTPGRIKDHLQSGRLDLSKLRHVVLDEVDQML 
DLGFAEQVEDIIHESYKTDSEDNPQTLLFSATCPQWVYTVA\KK 
YMKSRYEQVDLDGKMTQKAATTVEHLAIQCHWSQRPAVIGDVLQ 
VYSGSEGRAIIFCETKKNVTEMAMNPHIICQNAQCLHGDIAQSQR 
E I TLKGFREGS FKViVATNVAARGLDI PEVDLVIQSS PPQDVES 
YIHRSGRTGRAGRTGICICFYQPRERGQLRYVEQKAGITFKRVG 
VPSTMDLVKSKSMDAIRSLASVSYAAVDFFRPSAQRLIEEKGAV 
DALAAALAHISGASSFEPRSLITSDKGFVTMTLESLEEIQDVSC 
AWKELNRKLSSNAVSQITRMCLLKGNMGVCFDVPTTESERLQAE 
WHDSDWILSVPAKLPEIEEYYDGNTSSNSRQRSGWSSGRSGRSG 
RSGGRSGGRSGRQSRQGSRSGSRQDGRRRSGNRNRSRSGGHKRS 
FD*VFYHLVDFLSDFLVDSVYLTGRQIDHLTGLTGLIDHLTSHS 
SVWN 


6787 


2646 




PSSFPKNVPLEELEEPPK*KRSGLGSLTPKSQIQNGP*PQTFFF 
FELGSPSGVISAHCNLRLLGSSDSPAPASRVAGIIGTCHHAWLI 
LVFLVEMGFHHVGQAGLKLLTL\VIHPPWPPKVLGLQT 


6788 


16 


936 


GGTVDLR\DMLAVSVIjAAVRGGR/ATVRRVRESMVLHEK8KGK.T 
REGAEDKMTSGDVLSNRKMFYLLKTAFPSVQINTEEHVD\ELDQ 
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SEQ 
NO: 


Predicted 

nucleotide 

location 

corresponding 

amino acid 
sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 


Ammo acid segment containing signal peptide 

Glutamic Acid, F=Phenylalaniae, G-Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 

Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








3 _L GS*DS*GYPKGK*LLPKEVPSR/RVLLSGLTPLDATQE\ 
FTEDLSK\YVTTMVCVAVNGKPMLGVIHKPFSEYTAWAMVDGGS 
NVKAESSYNEKTPRIWSRSHSGMVKQVALQTFGNQTTIIPAGG 
AGYKVLALLDVPDKSQEKADLYIHVTYI KKWD I CAGNAILKALG 
GHMTTLSGEE I S YTGSDG IEGGLLAS I RMNHQALVRKLPDLE KT 
GHK 


S7B9 


2 


678 


GNGINVLKIAPESAIKFMAYEQIKRLVW**PGDS*GF/YERLVA 
GSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARRILARE 
GVAAFYKGYVPNMLGIIPYAGIDLAVYETLKNAWLQHYAVNSAD 
PGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVT 
MS S L FKH I LRTEGAFGLYRGLAPNFMEVI PAVS I S YWYENLKI 
TLGVQSR 


6790 


2 


4068 


APPAGRRRMQAAPRAGCGAALLLWIVSSCLCRAWTAPSTSQKCD 
EPLVSGLPHVAFSSSSSISGSYSPGYAKINKRGGAGGWSPSDSD 
HYQWLQVDFGNRKQISAIATQGRYSSSDWVTQYRMLYSDTGRIIW 
KPYHQDGNIWAFPGNINSDGWRHELQHPI IARYVRIVPLDWNG 
EGRIGLRIEVYGCSYWADVINFDGHWLPYRFRNKKMXTLKDVI 
ALNFKTSESEGVI LHGEGQQGDYI TLELKKAKLVLS LNLGSNQL 
GPIYGHTSVMTGSLLDDHHWHSWIERQGRSINLTLDRSMQHFR 
TNGEFDYLDLDYEITFGGIPFSGKPSSSSRKNFKGCMESINYNG 
VNITDLARRKKLEPSNVGNLSFSCVEPYTVPVFFNATSYLEVPG 
RLNQDLFSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVG 
VHINITQTKMSQIDISSGSGLNDGQWHEVRFLAKEWFAILTIDG 
DEASAVRTNSPLQVKTGEKYFFGGFLNQMNNSSHSVLQPSFQGC 
MQIjIQVDDQLVHLYEVAQRKPGSFANVSIDMCAIIDRCVPNHCE 
HGGKCSQTWDSFKCTCDETGYSGATCHNSIYEPSCEAYKHLGQT 
SNYYWIDPDGSGPLGPLKVYCNMTEDKVWTIVSHDLQMQTPWG 
YNPEKYSVTQLVYSASMDQISAITDSAEYCEQYVSYFCKMSRLL 
NTPDGSPYTWWVGKANEKHYYWGGSGPGIQKCACGIERNCTDPK 
YYCNCDADYKQWRKDAGFLSYKDHLPVSQVWGDTDRQG3EAKL 
SVGPLRCQGDRNYWNAASFPNPSSYLHFSTFQGETSADISFYFK 
TLTPWGVFLENMGKEDFIKLELKSATEVSFSFDVGNGPVEIWR 
SPTPLNDDQWHRVTAEROTKQASLQVDRLPQQIRKAPTEGHTRL 

SGCSGHCTSYGTNCENGGKCLERYHGYSCDCSNTAYDGTFCNKD 
VGAFFEEGMWLRYNFQAPATNARDSSSRVDNAPDQQNSHPDLAQ 
EEIRFSFSTTKAPCILLYISSFTTDFLAVLVKPTGSLQIRYNLG 
GTREPYNIDVDHRNMANGaPHSVWirRHEKTIFLKLDHYPSVSY 
HLPSSSDTLFNSPKSLFLGKVIETGKIDQEIHXYNTPGFTGCLS 
RVQ FNQ 1 APLKAALRQTNASAHVHI QGELVESNCGASPLTLS PM 
SSATDPWHLDHLDSASADFPYNPGQGQJURNGVNRNSAIIGGVI 
A\ WI FTPSLCTP\VLP * SR*HVSPHKGTLPI PNEAKGAGSRQK 
KPGRRPSMNNDPPTSQRPIDESKKEWPHLRGGYLAMG 


S791 


1801 


1193 


i a KGEKGDKGDLGPRGERGQHGPKGEKGYPGIPPEL/PGW 
SAW*SWLTAASTKVQAILLPQPLE*LGLQIAFMASLATHFSNQ 
NSGIIFSSVETNIGNFFDVMTGRFGAPVSGVYFFTFSMMKHEDV 
EEVYVYLMHNGMTVFSMYSYEMKGKSDTSSNHAVLKLAKGDEVW 
LRMGNGALHGDHQRFSTFAGFLLFETK 


6792 


33 


1073 


VRHTNKGVDMYLFSLGSE8PKGAIGHIVSTEKTILAVERNKVLL 
P PLWNRTFS WGFDDFS CCLGS YGSD.KVLMTFENIAAWGR CLCAV 
CPSPTTIVTSGTSTWCVWELSMTKGRPRGLRLRQALYGHTQAV 
TCLAASVTFSLLVSGSQDCTCILWDLDHLTHVTRLPAHREGISA 
I T I SDVSGT IVS CAGAHLSLWNVNGQPLAS I TTAWGPEGAITCC 
CLMEGPAWDTSQIIITGSQDGMVRWKT/VGCEDVCSWTASRRG 
APGSASKPKRPQVGEEPGLESRAGR*HCFDREAQQNQP\PVTAL 
AVSRNHTKLLVGDERGRI?CWSADG*EERGSRGSGTTVPG 
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SEQ 


Predicted 


Predicted end 


Amino acid segment containing signal peptide 




beginning 


nucleotide 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 


NO: 


nucleotide 




Glutamic Acid, F=Phenylalanine , G=Glycine, 






corresponding 


H=Histidine, I=Isoleucine, K=Lysine, 




corresponding 


to first 
amino acid 


L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 






residue of 


S=Serine, T=Threonine, V=Valine, 




residue of 








amino acid 




Codon, /=possible nucleotide deletion, ? 








\=possible nucleotide insertion) 


S793 


2340 


80S 


GRKEANY\YGSLTQAGTVSL,GIiDAEGQEVFVPFSAVLPMVAPND 
LVFDGWDISSLNLAEAMRRAKVLDWGLQEQLWPHMEALRPRPSV 
YIPEFIAANQSARADNLIPGSRAQQLEQIRRDIRDFRSSAGLDK 
VIVLWTANTERFCEVIPG1NDTAENLLRTIELGLEVSPSTLFAV 
AS I LEGCAFLNGSPQNTLVPGALELAWQHRVFVGGDDFKSGQTK 
VKSVLVDFLIGSGLKTMSIVSYNHLGNNDGENLSAPLQFRSKEV 
SKSNWDDMVQSNPVLYTPGEEPDHCWIKYVPYVGDSKRALDE 
YTSELMLGGTNTLVLHMTCEDSLLAAPIMLDLALLTELCQRVSF 
CTDMDPEPQTFHPVLSLLSFLFKAPLVPPGSPWNALFRQRSCI 
ENILRACVGLPPQNHMLLEHKMERPGPSLKRVGPVAATYPMLNK 
KGPVPAATNGCTGDANGHLQEEPPMPTT*GPGHTVSRLFLPAAP 
HDPTLKAPTNKGRCHFS P PS TWGSWGL 


5794 






EPAPPAPAPPEDHPDEEMGFTIDIKSFLKPGEKTYTQRCRLFVG 
NLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRTLAE 
IAKAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQA 
FSQFGPVEKAWWDDRGEATGKGFVEFAAKPPARKALERCGDG 
AFLLTTTFRPVIVEPMEOFDDEDGLPEKLMQKTQQYHKEREQPP 
RFAQPGTFEFEYASRWKALDFHFI ( - 1 -RH j-EAKEKLEAE 
MEAARHEHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLR 
HEEEHRRREEEMIRHREQEELRRQQEGFKPNYMENYVCHFLR 


6795 


1740 


1010 


I . >T JVRDIIELDSF-*DWAAQETDCAQNSGERL*K3V/LENFS 
TKSKSAVKISLDLLSNPLCEQDQDLLNMVTALDTAMKRMDAFNQ 
EKVNQIQKTVIEPLKKFGSVFPSLNMAVKRREQALQDYRRLQAK 
VEKYEEKEKTGPVLAKLHQAREELRPVREDFEAKWRQLLEEMPR 
FYGSRLDYFQPSFESLIRAQWYYSEMHKIFGDLSHQLDQPGHS 
DEQRERENEAKLSELRALS IVADD 


67 96 


48 


683 


GKE I Q I P TI KLAWLLFGLE * PVGALGKGWSF* * SHVALGQLGW 

SVQEGRDCWQR*LPRLFSALVGQPGCWPQGAPPERCV*PGRCKW 
HLQSQVLR*ERRRCCRCLPRFA*GWRRRHQRLGLGIHPAPLGST 
SPPHPEGNSQQCRR*GWAAELRLPSSWL*GKLGC* 


6797 


1620 


211 


TERMTPSQPTRGSSCTRFSSMLWTSTWRCLTCKWAGMRMSWGV 
TLGPMAQGLLSASGTTTEATWTRPTTHLTLIRWWLLTASRVDPP 
ERPPPPPSDDLTLLESSSSYKNL/DAQIPQ/DWSMSPSTSG*RP 
LTSRASSIMRSRTAIPSAS*SRLTTKHTVGGSPSAWRPRPTSRS 
VSTPVSSSTETTASGSCLTWWSSSPAPCPSSSAPAHSFEASCCK 
TSLWGSCGGSGDGSSACGSGWNLSMAGTSCSSPAMCSPSRAPS* 
RSASRPRTWRATTSAASSWAPRRCWCGWA*SAT*PSSTTTISSS 
PHCGWPCPASCASAAAWLSSTWATASVAGSCWGPIM*SSAHSPW 
CLSACSRSSMGTTCL*RSPP\SGASRAAAAWCGSSPSSTFTPSS 
ASSSTWCSASSSRSSPAPTTPSSIPAAQAQRRASCRPTSHSART 
AP PPAS SAAGAAR PAAFSAAAEGTPRRS I RCW 


6798 


3894 


1696 


STISWESLESWLNKATNPSNRQEDWEYIIGFCDQINKELEG*VS 
ALWGQLRGSGLGRGTTMAKEGQPGSPRLSALECVLLVPQ\PQIA 
VRLLAHKIQSPQEWEALQALTYLGDRVSEKVKTKVIELLYSWTM 
ALPEEAKIKDAYHMLKRQGIVQSDPPIPVDRTLIPSPPPRPKNP 
VFDDEEKS KLLAKLLKSKNPDDLQEANKL IKSMVREDEAR I QKV 
TKRLHTLEEVNNNVRLLSEMLLHYSQEDSSDGDRELMKELFDQC 
ENKRRTLFKLASETEDNDNSLGDILQASDNLSRVINSYKTIIEG 
QVINGEVATLTLPDSEGNSQCSWQGTLIDLAELDTTIJSIiSSVLA 
PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNAL 
SWLDEELLCLGLADPAPNVPPKESAGNSQWHLLQREQSDLDFFS 
PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 
APSAGSSLFSTGVAPALAPKVEPAVPGHHGLALGNSALHHLDAL 
DQLLSEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLFQ | 
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SEQ 
ID 

NO: 


Predicted 
beginning 

corresponding 
sequence 


Predicted end 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLSFQSQGSPPKGPELSIASIHVPLESIKPSSALPVTAYDKNGF 
RILFHFAKECPPGRPDVLVWVSMLNTAPLPVKSIVLQAAVPKS 
MKVKLQPPSGTELSPFSPIQPPAAITQVMLLANPLKEKVRLRYK 
LTFALGEQLSTEVGEVDQFPPVEQWGNL 


6799 


3894 


1696 


STISWESLESWLNKATNPSNRQEDWEYIIGFCDQINKELEG*VS 
ALWGQLRGSGLGRGTTMAKEGQPGSPRLSALECVLLVPQ\PQIA 
VRLIAHKIQSPQEWEALQALTYLGDRVSEKVKTKVIELLYSWTM 
ALPEEAKIKDAYHMLKRQGIVQSDPPIPVDRTLIPSPPPRPKNP 
VFDDEEKSKLLAKLLKSKNPDDLQEANKLIKSMVREDEARIQKV 
TKRLHTLEEVNNNVRLLSEMLLHYSQEDSSDGDRELMKELFDQC 
ENKRRTLFKLASETEDNDNSLGDILQASDNLSRVINSYKTIIEG 
QVINGEVATLTLPDSEGNSQCSNQGTLIDLAELDTTNSLSSVLA 
PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNAL 
SWLDEELLCLGLADPAPNVPPKESAGNSQWHLLQREQSDLDFFS 
PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 
APSAGSSLFSTGVAPALAPKVEPAVPGHHGLALGNSALHHLDAL 
DQLLEEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLFQ 
PLSFQSQGSPPKGPELSLASIHVPLESIKPSSALPVTAYDKNGF 
RILFHFAKECPPGRPDVLVVWSMLNTAPLPVKSIVLQAAVPKS 
MKVKLQPPSGTELSPFSPIQPPAAITQVMLLANPLKEKVRLRYK 
LTFALGEQLSTEVGEVDQFPPVEQWGNL 


6800 


404 


1646 


RRSPSTGLSPVPQPSSPSLSDYSIPWSLLLSGTIAWATPGK*AG 
*PQAW*LGIiAPAIAFl/GLTRGRKQNKEKMAEGGSGDVDDAGDC 
SGARYNDWSDDDDDSNESKSIVWYPPWARIGTEAGTRARARARA 
RATRARRAVQKRASPNSDDTVLSPQELQKVLCLVEMSEKPYILE 
AAL I ALGNNAAYAFNRD I IRDLGGLP I VAKILNTRDP I VKEKAL 
IVLNNLSVNAENQRRLKVYMNQVCDDTITSRLNSSVQLAGLRLL 
TNMTVTNEYQHMLANSISDFFRLFSAGNEETKLQVLKLLLNLAE 
NPAMTRELLRAQVPSSLG\SLFNKKENKEVILKLLVIFENINDN 
FKWEENEPTQNQFGEGSLFFFLKEFQVCADKVLGIESHHDFLVK 
VKVGKFMAKLAEHMFPKSQE 


6801 


2 


1755 


SAEEFESQQASVTMHDVDAESFEVLVDYCYTGRVSLSEANVERL 
YAASDMLQLE YVREACAS FLARRLDLTNCTAI LKFADAFGHRKL 
RSQAQSYIAQNFKQLSHM3SIREETLADLTLAQLLAVLRLDSLD 
VESEQTVCHVAVQWLEAAPKERGPSAAEVFKCVRWMHFTEEDQD 
YLEGLLTKPIVKKYCLDVIEGALQMRYGDLLYKSLVPVPNSSSS 
/R*QQQLSCICSRKSTPETGYVCQGDGDLLWTPQRSLS\RYDPY 
SGDIYTMPSPLTSFAHTKTVTSSAVCVSPDHDIYLAAQPRKDLW 
VYKPAQNSWQQLADRLLCREGMDVAYLWGYIYILGGRDPITGVK 
LKEVECYSVQRNQWALVAPVPHSFYSFELIWQNYLYAVNSKRM 

PARGEKRRISNIPLDSETHNY^ 

VTVYEYDTREDQWINIGTMLGLLQFDSGFICLCARVYPSCLEPG 
QSFITEEDDARSESSTEWDLDGFSELDSESGSSSSFSDDEVWVQ 
VAPQPJMAQDQQGSL 


6802 


157 


1341 


ETFPLFFFLLSKTPGKTASMAHFVQGTSRMIAAESSTEHKECAE 
PSTRKNLMNSLEQKIRCLEKQRKELLEWQQWDQQFRSMKELYE 
RKVAELKTKLDAAERFLSTREKDPHQRQRKDDRQREDDRQRDLT 
RDRLQREEKEKERLNEELHELKEENKLLKGKNTLANKEKEHYEC 
EIKRLNKALQDALNIKCSFSEDCLRKSRVEFCHEEKRTEMEVLK 
QQVQIYEEDFKKERSDRERLNQEKEELQQINETSQSQLNRLNSQ 
IKACQMEKEKLEKQLKQMYCPPCNCGLVFHLQDPWVPTGPGAVQ 
KQREHPPDYQWYALDQLPPDVQHKAN/DWCLAPPPVCCQAG/PR 
TPGLK*SSCLWLPKC*NFRFILSKESPSVEVHTNRERQQATRER 
G 


6803 


1 


2203 


KLSGRPYREMGVLGTSKLYDIRKTIFTFTPQFIDQQQFYLALDN 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 

amino acid 
sequence 


nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
Li=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=tJnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KMIVEMLRTDLSYLCSRWRMTGQPTITFPISHSMLDEDGTSLNS 
SILAALRKMQDGYFGGARVQTGKLSEFLTTSCCTHLSFMDPGPE 
GKLYSEDYDDNYDYLESGNWMNDYDSTSHARCGDEVARYLDHLL 
AHTAPHPKIAPTSQKGGLDRFQAAVQTTCDLMSLVTKAKELHVQ 
NVHMYLPTKLFQASRPSFNLLDSPHPRQENQVPSVRVEIHLPRD 
QSGEVDFKALVLQLKETSSLQEQADILYMLYTMKGPDWNTELYN 
ERSATVRELLTELYGKVGEIRHWGLIRYISGILRKKVEALDEAC 
TDLLSHQKHLTVGLPPEPREKTISAPLPYEALTQLIDEASEGDM 
SISILTQEIMVYLAMYMRTQPGLFAEKFRLRIGLIIQVMATELA 
HSLRCSAEEATEGLMNLSPSAMKNLLHHILSGKEFGVERK/SVR 
PTDSNVSPAISIHEIGAV3ATKTERTGIMQLKSEIKQVEFRRLS 
ISAESQSPGTSMTPSSGSFPSAYDQQSSKDSRQGQWQRRRRLDG 
ALNRVPVGFYQKVWKVLQKCHGLSVEGFVLPS3TTREMTPGEIK 
FSVHVES\VLWLLRPEYRQLLVEAILVLTMLADIEIHSIGSII 
AVEKIVHIANDLFLQEQKTLGP\DDTMLAKDPASG\ICTLR\YD 
SAPSGRFGTMTYLS \RAA\ATYVQEFLP\HS ICAMQ 


S804 


> 


951 


JSPflKP FFKAKNKESLCMENSSNSSSDEDEEETKAKMTPTKKYN 
GLEEKRKSLRTTGFYSGFSEVAEKRIKLLNWSDERLQIJSRAKDR 
KDVWSSIQGQWPKKTLKELFSDSDTEAAASPPHPAPEEGVAEES 
LQTVAEEESCSPSVELEKPPPVUVDSKPIEEKTVEVNDRKAEFP 
SSGSNFSA* IPLPYLHLNRLHQSL*QKGSRQQSSVTVSEPLAPN 
QEEVRS IKS ETDSTI EVDS VAGELQDLQSERE * LASRF* CQCEL 
KQ**SARTRTS*KSLYRSEKSERCSGRRKFIKKAEKKP*SNSGK 
QQKEGKRHK 




1539 


206 


RQPDLKYFGKSFDVSVSESSSLLSNDLPKFADGIKARNRNQNYL 
VPSPVLRILDHTAFSTEKSADIVICDEECDSPESVNQQTQEESP 
IEVHTAEDVPIAVEVHAISEDYDIETENNSSESLQDQTDEEPPA 
KLCKILDKSQADNVTAQQKWPLLRANSSGLYKCELCEFNSKYFS 
DLKQHMILKHKRTDSNVCRVCKESFSTNMLLIEHAKLHEEDPYI 
CKYCDYKTVIFENLSQHIADTHFSDHLYKCEQCDVQFSSSSELY 
LHFQBHSCDEQYLCQFCEHETNDPEDIiHSHWNEHACKLIELSD 
KYNNGEHGQYSLLSKITFDKCKHFFVCQVCGFRSRLfflTNWRHV 
AIEHTKIFPHVCDDCGKG?SSMLE\IAKHLNSHLSEGIYLCQYW 
EYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGNERELISHLP 
VHETT 


5806 




3794 


VALCFPNSDPVMFMDAFYGCLLAELGPVPIEVPLTRKDAGSQQV 
GFLLGSCGVFLALTTDACQKGLPKAQTGEVAAFKGWPPLSWLVI 
DGKHLAKPPKDWHPLAQDTGTGTAYIEYKTSKEGSTVGVTVSHA 
SLLAQCRALTQACGYSEAETLTNVLDFKRDAGLWHGVLTSVMNR 
MHWSVPYALMKANPLSWIQKVCFYKARAALVKSRDMHWSLLAQ 
RGQRDVSLSSLRMLIVADGANPWSISSCDAFLKVFQSRGLRPEV 

TEEKLSVLTVQDVGQVMPGANVCWKLEGTPYLCKTDEVGEICV 
SSSATGTAYYGLLGITKNVFEAVPVTTGGAPIFDRPFTRTGLLG 
F IGPDHLVFI VGKLDGLMVTGVRRHNADDWATALAVE PMKFVY 
RGRIAVFSVTVLHDDRIVLVAEQRPDASEEDSFQWMSRVLQAID 
SIHQVGVYCLALVPANTLPKAPLGGIHISETKQRFLEGTLHPCN 
VLM C PHTCVTNLP KPRQKQPE VGPASMI VGNLVAGKR IAQASGR 
7LT-.-L3DSDQARKFLFLADVLQWRAHTTPDHPLFLLLNAKGTVT 
STATCVQLHKRAERVAAALMEKGRLSVGDHVALVYPPGVDLIAA 
FYGCLYCGCVPVTVRPPHPQNLGTTLPTVKMIVEVSKSACVLTT 
QAVTRLLRSKEAAAAVDIRTWPTILDTDDIPKKKIASVFRPPSP 
DVLAYLDFSVSTTG ILAGVKMSHAATSALCRS I KLQCELYPSRQ 
IAICLDPYCGLGFALWCLCSVYSGHQSVLVPPLELESNVSLWLS 
AVSQYKARVTFCCYSVMEMCTKGLGAQTGVLRMKGTOLSCVRTC 
MWAEERP\RIALTQSFSKLFKDLGLPARAVSTTFGCRVNVAIC 
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SEQ 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 


Predicted end 

corresponding 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








LQGTAGPDPTTVYVDMRALRHDRVRLVERGSPHSLPLME3GKIL 
PGVKVIIAHTETKGPLGDSHLGEIWVSSPHNATGYYTVYGEEAL 
HADHFSARLSFGDTQTIWARTGYLGFLRRTELTDASGGRHDALY 
WGSLDETLELRGMRYHPIDIETSVIRAHRSIAECAVFTWTKLL 
WWELDGLEQDALDLVALVTNWIjEEHYLWGVWIVDPGVIP 
INSRGEKQRMHLRDGFLADQLDPIYVAYNM 


6807 


1444 


606 


VGHDTVHAMFTCFPKCLGFSPPVWVTVSPRSEESHTTTVSGGNG 
. SVFQAGPQLQALANLEARRGSIGAALSSRDVSGLPVYAQSGEPR 
RLTQAQVAAFPGENALEHSSDQDTWDSLRSPGFCSPLSSGGGAE 
SLPPGGPGHAEAGHLGKVCDFHLNHQCPSPTSVLPTEVAAPPLE 
KILSVDSVAVDCAYRTVPKPGPQPGPHGSLLTEGCLRSLSGDLN 
RFPCGMEVHSGQRELESWAVGEAMA\LKFPMGAMSYCLRDRSR 
FLFRLPMGLSCPLQVQ 


S808 


2 0 53 


737 


GVGSGAASALARSRPLASRLSSRRRTRAPRSGAMQRLAMDLRML 
SRELSLYLEHQVRVGFFGSGVGLSLILGFSVAYAFYYLSSIAKK 
PQLVTGGESFSRFLQDHCPWTETYYPTVWCWFGRGQTLLRPF\ 
ITSKPPVQYRNELIKTADGGQISliDWFDNDNSTCYMDASTRPTI 
LLLPGLTGTSKE S Yl LHMIHLS EELG YRCWFNNRGVAGENLLT 
PRTYCCANTEDLETVIHHVHSLYPSAPFIiAAGVSMGGMLLLNYL 
C I 3 -KTlLMAAATFSVGWNTFACSSSLEKPLNWLLFNYYLTTC 
LQS3VNKHRHMFVKQVDMDHVMKAKSIREFDKRFTSVMFGYQTI 
DDYYTDASPSPRLKSVGIPVLCLNSVDDVFSPSHAIPIETAKQN 
PNVALVLTSYGGHIGFLEGIWPRQSTYMDRVFKQFVQAMVEHGH 




939 


55 


DYSGQTPVPTEHGMTLYTPAQTHPEQPGSEASTQPIAGTQTVPQ 
TDEAAQTDSQPLHPSDPTEKQQPKRLHVSNIPFRFRDPDLRQMF 
GQFGKILDVEIIFNERGSKGFGFVTFETSSDADRAREKLNGTIV 
EGRKIEVNNATARVMTNKKTGNPYTNGWKLNPWGAVYGPEFYA 
VTGFPYPTTGTAVAYRGAHLRGRGRAVYNTFRAAPPPPPIPTYG 
AWYQDGFYGAE I \LEATQPTDTLSPLQRRQPTATVTAESTQLP 


6810 


939 


S5 


DYSGQTPVPTEHGMTLYTPAQTHPEQPGSEASTQPIAGTQTVPQ 
TDEAAQTDSQPLHPSDPTEKQQPKRLIIVSNIPFRFRDPDLRQMF 
GQFGKILDVE I I FNERGSKGFGFVTFETSSDADRAREKLNGTIV 
EGRKIEVNNATARVMTNKKTGNPYTNGMKLNPWGAVYGPEFYA 
VTGFPYP TTGTAVAYRGAHLRGRGRAVYNTFRAAP PPPPIPTYG 
AWYQDGFYGAE I \ LEATQPTDTLS PLQRRQPTATVTAES TQLP 
'J k' i' I I V riUFRRPTALEPCETFHRFLLGP 


6811 


1522 


658 


DLVTVWSFVDCRVIASTHGH\KSWVSWAFDPYTTSVEEGDPME 
FSGSDEDFQDLLHFGRDRADSTQCRLSRRNSTDSRPVSVTYRFG 
SVGQDTQLCLWDLTEDILFPHQPLSRARTHTNVMNATSPPAGSN 
GNSVTTPGNSVPPPLPRSNSLPHSAVSNAGSKSSVMDGAIASGV 
SKFATLSLHDRKERHHEKDHKRNHSMGHISSKSSDKLNLVTKTK 
TDPAKTLGTPLCPRMEDVPLLEPLICKKIAHERLTVLIFLEDCI 
VTACQEGFI CTWGRPGKWS FNP 


6812 


4001 


1682 


kU/'LVi-'SLDLSTIIQGTWFLNGEELKSNEPEGQVEPGALRYRIEQ 
KGLQHRLILHAVKHQD3GALVGFSCPGVQDSAALTIQESPVHIL 
SPQDKVSLTFTTSERWLTCELSRVDFPATWYKDGQKVEESELL 
WKMDGRKHRL1LPEAKVQDSGEFECRTEGVSAFFGVTVQDPPV 
HIVDPREHVFVHAITSECVMLACEV\DR\EDAPVRWYKDGQEVE 
ESDFWLENEGPHRRLVLPATQPSDGGEFQCVAGDECAYFTVTI 
TDVSSWIVYPSGKVYVAAVRLERWLTCELCRPWAEVRWTKDGE 
EWES PALLLQKEDTVRRLVLPAVQLEDSGEYLCE IDDESAS FT 
VTVTEP PVRI I YPRDEVTL IAVTLEC WLMCELSREDAPVRWYK 
DGLEVEESEALVLERDGPRCRLVLPAAQPEDGGEFVCDAGDDSA 
FFTVTVTEPPVQFLALETTPSPLCVAPGEPWLSCELSRAGAPV 
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Predicted 
beginning 
nucleotide 

corresponding 
) first 



Predicted end 
nucleotide 

corresponding 
to first 
amino acid 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Dnknown, + =Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



VWSHNGRPVQEGEGLELHAEGPRRVLCIQAAGPAHAGLYTCQSG 
AAPGAPSLSFTVQVAEPPVRVVAPEAAQTRVRSTPGGDLELWH 
LSGPGGPVRWYKDGERLASQGRVQLEQAGARQVLRVQGARSGDA 
GEYLCDAPQDSRIFLVSVEEPLLVKLVSDLTPLTVHEGDDATFR 
CEVSPPDADVTWLRNGAWTPGPQRQSCCSYGGCRMCGQRKART 
CVSKWRQAEWVQRGPCAGCEVGSPCPTTLACPWPRMGTSTASSS 
MVSYWPTRAPTAARATTIAPWPGSA 



SSTQQRPGVPAGPRPLDGYLGVADHKPLKMHCRDCALVTSSGHL 
LHSRQGSQIDQTECVIRMNDAPTRGYGRDVGNRTSLRVIAHSSI 
QRILRNRHDLLNVSQGTVFIFWGPSSYMRRDGKGQVYNNLHLLS 
QVLPRLKAFMITRHKMLQFDELFKQETGQ\NRKISNTWLSTGWF 
TMTIALELCDP.INVYGMGPPDFCRDPNHPSVPYHYYEPFGPDEC 
TMYLSHERGRKGSHHRFITEKRVFKNWARTFNIHFFQPDWKPES 
LA1NHPENKPVF 



" KFRRQEAN/ARERNRMHGLNDALDNLRKWPl iT" I ( 1 I : ~~ 
LRLAKNYIWALSEILRIGKRPDLLTFVQNLCKGLSQPTTIJLVAG 
CLQLNARSFLMGQGGEAAHHTRSPYSTFYPPYHSPELTTPPGHG 
TLDNSKSMKP YKYCSAYE S FYESTS PECAS PQFEGPLS PPPINY 
NGIFSLKQEETLDYGKNYNYGMHYCAVPPRGPLGQGAMFRLPTD 
SHFPYDLHLRSQSLT MQ DELNAVFHN 
QGI7DPASQTKVVELLKIDGSGRRGDRRSSRDMAGGAGPRSESDLE 
DVGPTAEWNGDGSGSLRRSGSFGKLRDALRRSSEMLVKKLQGGT 
PQEP PNPRMKRAS SLWFLKKS VEE PTQPGG 



NLLKTHKFVLLGQDEDSLHSVPVAQMGNYOEYLKTIASPLREID 
PDQPKRLHTFGNPFKQDKKGMMIDEADEFVAGPQNKVKRPGEPN 
SPMSSKRRRSMSLLLRKPQTPPTVTNHVGGKGPPSASWFPSYPN 
LI KPTL VHTDAT I I HDGHE E KMENGQ I TPDG FLSKSAPSELI NM 
TGDLMPPNQVDSLSDDFTSLSKDGLIQKPGSNAFVGGAKNCSLS 
VDDQKDPVASTLGAMPNTLQ I TPAMAQGINAD I KHQLMKE VRKF 
GRSK 



LGMMDSPKIGNGLPVIGPGTDIGISSLHMVGYLGKNFDSAKVPS 
DEYCPACKEKGKLKALKTYRISFQESIFLCEDLQCIYPLGSKSL 
NNLISPDLEECHTPHKPQKRKSLESSYKDSLLLANSKKTRNYIA 
JSIOWGEVYDETSSNLPDSSGQQNPIRTADSLERNEI 
LEADTVDMATTKDPATVDVSGTGRPSPQNEGCTSKLEMPLESKC 
TSFPQALCVQWKNAYALCWLDCILSALVHSEELKNTVTGLCSKE 

; I FWRLLTKYMQANTLLYTS QLSGVKDGDCKKLTSE I FAE 1 
CLNEVRDEIFISLQPQLRCTLGDMESPVFAFPLLLKLETHIEKL 
FLYSFSWDFECSQCGHQYQNRHMKSLVTFTNVIPEWHPLNAAHF 
GPCNNCNSKSQIRKMVLEKVSPIFMLHFVEGLPQNDLQHYAFHF 
EGCLYQITSVIQYRANNHFITWILDADGSWLECDDLKGPCSERH 
KKFEVPASEIHIVIWERKISQVTDKEAACLPLKKTNDQHALSNE 
KPVSLTSCSVGDAASAETASVTHPKDISVAPRTLSQDTAVTHGD 
HLLSGPKGLVDNILPLTLEETIQKTASVSQLNSEAFL\LENKPV 
AENTGILKTNTLLSQESLMASSVSAPCNEKLIQDQFVDISFPSQ 
WNTNMQSVQLKTEDTVNTKSVNNTDATGLIQGVKSVEIEKDAQ 
LKQFLTPKTEQLKPERVTSQVSNLKKKSTTADSQTTTSKSLQNQ 
SLKENQKKPFVGSWVKGLISRGASFMPLCVSAHNRNTITDLQPS 
VKGVNNFGGFKTKGINQKASHVSKKARKSASKPPPISKPPAGPP 
SSNGTAAHPHAHAASEVLEKSGSTSCGAQLNHSSYGNGISSANH 
EDLVEGQIHKLRLKLRKKLKAEKKKLAALMSSPQSRTVRSENLE 
QVPQDGSPMDCESrEDLLNELPYPIDIAMESACTTVPGVSLYSS 
QTHEEILAELLSPTPVSTELSENGEGDFRYLGMGDSHIPPPVPS 
EFNDVSQNTHLRQDHNYCSPTKKNPCEVQPDSLTNNACVRTLNL 
ES PMKTDI FDEFFSS SALNALANDTLDLPHFDE YLFENY 
RGFDKVLWT/LSGAVK\CVQFSRISPDGEEGYPGELKVWVTYTL 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 


Predicted end 
nucleotide 

co r re sponding 

amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysir.e, 
L=jjeucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 




sequence 


amino acid 


W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DGGE/LHS/ATTEHKP/VQATPVNLT\TILTSTWQARLPQI 


S819 


1 


961 


GI PCTEMGNFDNANVTGEI E FAI HYCFKTHSLE ICI KACKNLAY 
GEEKKKKCNPYVKTYLLPDRSSQGKRKTGVQRNTVDPTFQETLK 
YQVAPAQLVTRQLQVSVWHLGTLARRVFLGEVIIPLATWDFEDS 
TTQSFRWHPLRAKADKYEDSVPQSNGELTVRAKLVLPSRPRKLQ 
EAQEGTDQPSLHGQLCLWLGAKNLPVRPDGTLNSFVKGCLTIjP 
DQQKLRLKSPVLRKQACPQWKHSFVFSGVTPAQLRQSSLELTVW 
DQALFGMNDRLLGGT\RL3SKGDTAVGGDACSQSKLQWQKVLSS 
PNLWTDMTLVLH 


6820 


1014 


340 


GDMVYIVGHVPPGFFEKTQNKAWFREGFNEKYLKVVRKHHRVIA 

GANNPAIRVFEYDRATLSLKDMVTYFMNLSQANAQGTPRWELEY 
QLTEAYGVPDASAHSMHTVLDRIAGDQSTLQRYYVYNSVSYSAG 
VCDEACSMQHVCAMRQVDIDAYTTCLYASGTTPVPQLPLLLMAL 


6821 


1088 


518 


EFDIYR/EVGGEFVPVTRDDKKNGFPRTQHGPSPTVHPIQSPQN 
RFCVLTLDPETLPAIATTLIDVLFYSHSTPKEAASSSPEPSSIT 
FFAFSLIEGYI\SIVMDAETQKKFPSDLLLTSSSGELWRMVRIG 
GQPLGFDECGI VAQ I AGPLAAAD I SAYY I STFNFDHALVPEDGI 
GSVIEVLQRRQEGLAS 


6822 




518 


RFCVLTLDPETLPAIATTLIDVLFYSHSTPKEAASSSPEPSSIT 
FFAFSLIEGYI\SIVMDAETQKKFPSDLLLTSSSGELWRMVRIG 
GQPLGFDECGI VAQ IAGPLAAADISAYYISTFNFDHALVPEDGI 
GS VI E VLQRRQEGLAS 


6H23 


654 


221 


PPKLLSRWARMGHGDEIV\LSDLNFPGLLHLPVVGPWRSVQTAC 
GIPQLLEAVLKLLPLDTYVESPAAVMELVPSDKERGLQTPVWTE 
YESILRRAGCVRALAKIERFEFYERAKKAFAWATGETALYGKL 
ILRKGVLALNPLL 








ALPKLGICAGLQWLGLPFLLENPSGYLSRSFDLGRQFLFHWTV 
NWRFLPEALFLHRAFHLALLTAHLTLLLLFALCRWHRTGESILS 
LLRDPSKRKVPPQPLTPNQIVSTLFTSNFIGICFSRSLHYQFYV 
WYFHTLPYLLWAMPARWLTHLLRLLVLGIjIELSWNTYPSTSCSS 
AALHICHAVILLQLWLGPQPFPKSTQHSKKAH 


6825 


3 


1173 


SSGEFGLQASDIMWTISDTGWILIILCSLMEPWALGACTFVHLL 
PKFDPLVILKTLSSYPIKSMMGAPIVYRMLLQQDLSSYKFPHLQ 
NCLAGGESLLPETLENWRAQTGLDIREFYGQTETGLTCMVSKTM 
KIKPGYMGTAASCYDVQIIDDKGNVLPPGTEGDIGIRVKPIRPI 
GIFSGYVDNPDKTAANIRGDFWLLGDRGIKDEDGYFQFMGRADD 
IINSSGYRIGPSEVENALMEHPAWETAVISSPDPVRGEWKAF 
VILALQFLSHDPEQLTKELQQHVKSVTAPYKYPRKIEFVLNLPK 
TVTGKI QRA\ KLRDKEWKMSGKAPCAVRHLRDIHLDS PLLSLS F 
PFGPLALPMDGYGDSLWEEHEYKFCLALVISTKLYHVRC 


6826 


23 04 


954 


LKTESFXPW/WIALAFHLLGERASPNSFWQPYIQTLPREYDTP 
LYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHA 
NKLPLKDS FTYEDYRWAVSS VMTRQNQ I PTEDGSRVTLAL I PLW 
DMCNHTNGMTTGYNLEDDRCECVALQDFRAGBQIYIFYGTRSN 
AEFVIESGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 
PTSSVFALHFTEPPISAQLLAFLRVFCMT3EELKEHLL3DSAID 
RIFTLGHSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKS 
VLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAAVNREYYRQQME 
EKAPLPKYEESNLGLLESSVGDSRLPLVLRNLEEEAGVQDALNI 
RE AI S KAKATENGLVNGENS I PNGTRS ENES LNQE S KRAVEDAK 
GS3SDSTAGVKE 
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SEQ 
ID 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6827 


1 


779 


SSWEFGLSVLGGLFLLFVLENMLGLLRHRGLRPRCCRRKRRNL 
ETRNLDPENGSGMALQPLQAAPEPGAQGQREKNSQHPPALAPPG 
HQGHSHGHQGGTDITWMVLLGDGLHNLTDGLAIGAAFSDGFSSG 
LSTTLAVFCHKT.PHKr.inFAMTJ^R^T.SFRRLLLLSLVSGALGL 
GGAVLGVGLSLGPVPLTPWVFGVTAGVFLYVALVDMLPALFPSS 
GAPAYA\HVLLQGLGLLLGGCLMLAITLLEERLLPVTTEG 


6828 


3 


1654 


KSQHG/WILQLMHSCKEGYVKDIjKGNPGLHRAMLDLDNGTRPSE 
LGHLSQTASLKRGSSFQSGRDDTWRYKTPHRVAFVEKLTKLVLS 
QLPNFWKLWISYVNGSLFSETAEKSGQIERSKNVRQRQKDFKKM 
IQEVMHSLVKLTRGALLPLSIRDGEAKQYGGWEVKCELSGQWLA 
HAIQTVRLTHESLTALEIPNDLLQTIQDLILDLRVRCVI/1ATLQH 
TAEEIKRIAEKEDWIVDNEGLTSLPGQFEQCIVGSLQSLKGVLE 
CKPGEASVFQQPKTQEEVCQLSINIMQVFIYCLEQLSTKPDADI 
DTTHLSVDVSSPDLFGSIHEDFSLTSEQRLLIVLSKCCYLERHT 
FLNIAEHFEKHNFQGIEKITQVSMASLKELDQRLFENYIELKAD 
PIVGSLEPGIYAGYFDWKDCLPPTGVRNYLKEALVNI IAVHAEV 
FTISKELVPRVLSKVIEAVSEELSRLMQCVSSFSKNGALQARLE 
I CALRDTVAVYLTPES KS SFKQALEALPQLS SGADKKLLEELLN 
KFKSSMHLQLTCFQAASSTMMKT 


6829 


1 


782 


MRM2AGEAAPPAGAGGRAAGGWGKWVRLNVGCTVFLTTRQTLCR 
EQKSFLSRLCQGEELQSDRDETGAYLIERDPTYFGPILNFLRHG 
KLVLDKDMAEEGVLEEAE FYNIGPL I RI IKDRM3EKDYTVTQVP 
PKHVYRVLQCQEEELTQMVSTMSDGWRFEQLVIU iS£ YIJYGSED 
QAEFLCWSKELHSTPNGLSSESSRKTKSTEEQLEEQQQQEEHV 
EEVEVEQVQVEADAQEK 'CC 1 V" _ _Z L D riJ-jQGLGV P I 


6830 


1 




MEPGSVENLSIVYRSRPFLWNKHWDVRIDSKAWRETLTLQKQL 
RYRFPELADPDTCYGFRFCHQLDFSTSGALCVALNKAAAGSAYR 
CFKERRVTKAYLALLRGHIQESRVTISHA1GRNSTEGRAHTMCI 
EGSQGCENPKPSLTDLWLEIIGLYAGDPVSKVLLKPLTGRTHQL 
RV\HCSALGHPWGDLTYGEVSGREDRPFRMMLHAFYLRIPTDT 
ECVEVCTPDPFLPSLDACWSPHTLLQSLDQLVQALRATPDPDPE 
DRGPRPGSPSALLPGPGRPPPPPTKPPETEAQRGPCLQWLSEWT 
LEPDS 


6831 


3 


1087 


SLFFGSSTPDNKVAEQEDLETQPSPSVEKAVTVIDPEGT1PTNF 
NVAEKPADHSLSEVKLKTADEPRGTLVKSGDGQ1IVKEKSMILSN 
VEDLQQPKF ISEVSREDYGKKE 1 SGDSEEMN INSWTSADGENL 
EIQSYSLIGEKLVMEEAKTIVPPHVTDSKRVQKPAIAPPSKWNI 
SIFKEEPRSDQKQKSLLSFDWDKVPQQPKSASSNFASKNITKE 
SEKPESIILPVEESKGSLIDFSEDRIiKKEMCNPTSLKISEEETK 
LRSVSPTEKKDNLENR\SYTL\AEKKVLAEKQNSV\APLELRDS 
NEIGKTQITLGSRSTELKESKADAMPQHFYQNEDYNERPKirVG 
SEKEKDEKKKK 


6832 


1809 


412 


MGSGLISGPPQDNSGEALKEPERAQEHSLPNFAGGQHFFEYLLV 
VSLKKKRSEDDYEPIITYQFPKRENLLRGQQEEEERLLKAIPLF 
CFPDGNEWASLTEYPRETFSFVLTNVDG3RKIGYCRRLLPAGPG 
PRLPKVYCIISCIGCFGLFSKILDEVEKRHQISMAVIYPFMQGL 
REAAFPAPGKTVTLKSFIPDSGTEFISLTRPLDSHLEHVDFSSL 
LHCLSFEQILQIFASAVLERKIIFLAEGLSTLSQCIHAAAALLY 
PFSWAHTYIPWPESLLATVCCPTPFMVGVQMRFQQEVMDSPME 
EVLLWLCEGTFLMSVGDEKDILPPKLQDDILDSLGQGINELKT 
AEQ INEHVSGPFVQFFVKI VGHYAS Y I KREANGQGHFQERSFCK 
ALTSKTNRRFVKKFVKTQLFSLFIQEAEKSKNPPAGYFQQKILE 
YEEQKKQ/TETKGKNCEIRAVVNKND 


6833 


1 


1129 


PLI TI "QCGGIPGHGHSHGGHGHGHGLPKGPRVKSTRPGSSDIN 
VAPGEQGPDQEETNTLVANTSNSNGLKLDPADPENPRSGDTVEV 
0\^iC;i-ILVREPDHMELEEDRAGQLNMRGVFLHVLGDALGSVI\W | 
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SEQ 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 


Predicted end 

corresponding 

residue of 
sequence 


Ammo acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion. 








NALVFYFSWKGCSEGDFCVNPCFPDPCKAFVEIINSTHASVYEA 
□ PCWVLYLDPTLCWMVCILLYTTYPLLKESALILLQTVPKQID 
IRNLIKELRNVEGVEEVHELHVWQIAGSRIIATAHIKCEDPTSY 
MEVAKTIKDVFHNHGIHATTIQPEFASVGSKSSWPCELACRTQ 
CALKQCCGTLPQAPSGKDAEKTPAVSISCLELSNNLEKKPRRTK 
AENIPA\WIEIKN\IPNK\QPESSL 


5834 


73 


1151 


AGQERPAPIWRLLWLPTPSVSRKAEPAHIPINR*GA*E*RGGLP 

CSKLVLSGARGIVGTTVQVLVEAQQPLLLLFTGVWGLNLRAGEE 
SRAL*LIEEVTQVRDAHLGNAWGCAQCLSQGQVGSALAKALLE 
AAAAVRDCKEVLTVSGDKQQAEVSVRL*VRDVCVEEAGCVEFGQ 
AHGRPGLALAKGRGGTNEVEEQVQVDGVQKLVLSAHECHELVAG 
QQDGEDQAARTRLLQAGAHSVAHGRRQGQAP CRPHQEAGVS CHE 
LQQWGDAL * ARE* APQ 1 1 VLLLLED VAQLRTGKKA* DLWDVE 


S835 


1 




11 i SLELl 3R 11 r H 
AYTCYRPDVGYVQGMSFIAAVLILNLDTADAFIAFSNLLNKPCQ 
MAFFRVDHGLMLTYFAAFEVFFEENLPKLFAHFKKNNLTPDIYL 
IDW1FTLYSXSLPLDLACRIWDVFCRDGEEFLFRTALGILKLFE 
DILTKMDFIHMAQFLTRLPEDLPAEELFASIATIQMQSRNKKWA 
QVLTALQKDSREMREGKSVPPTLRLQREFALGTNQSPMPRPLCC 
FRLTFGQPRRTDAL 


6836 


1 


850 


MSCGRPPPDVDGMITLKV\DNLTYRTSPDSLRRVFEKYGRVGDV 
Y I PRE PHTKAPRGFAFVRFHDRRDAQDAEAAMDGAELDGRELRV 
QVARYGRRDLPRSRQGRRHAAGPEAA/RYGRRSRSYGRRSRSPR 
RRHRSRSRGPSCSRSRSRSRYRGSRYSRSFYSRSPYSRSRYSRS 
PYSRSRYRESRYGGSHYSSSGYSNSRYSRYHSSR3HSKSGSSTS 
SRSASTSKSSSARRSKSSSVSRSRSRSRSSSMTRSPPRVSKRKS 
KSRSRSKRPPKSPEEEGQMSS 








TDGAAVAGNPGSDYFPGGTAP/ GGPRTRRPN SGT5SSGSJCA5GP 
PNPPAQGDGTSLSPNYTLESTSGNDGKPVSGGGGRGRC-RRKRDS 
GHVS PGTFFDKYSAAPDSGGAPGVS PG2QQASGAAVGGSSAGET 
RGAPTPHEKALTSPSWCKGAELLLGDQPDLIGSLDGGAKSDSSS 
PNVGEFASDEVSTSYANEDEVSSSSDNPQALVKASRSPLVTGSP 
KLPPRGVGAGEHGPKAPPPALGLGIMSNSTSTPDSYGGGGGPGH 
PGTPGLEQVRTPTSSSGAPPPDEIHPLEILQAQIQIiQRQQFSIS 
EDQPLGLKGGKKGECAVGASGAQNGDSELGSCCSEAVKSAMSTI 
DLDSLHOAliHSAAWYMPADKALVDSADDDKTLAPWEKAKPQNPNS 
KEAHDLPANKASASQPGSHLQCLSVHCTDDVGDAKARASVPTWR 


S838 


16 


499 


LTDTPPPKTHMIHHSISDYKATLRCWALGFYPMEITLTWQQDEE 
DQTRDMELVETRPAGDGTFQKWAAVWPSGEE/Q/RYMCHVQHE 
GLPEPLTLRWEQSSQPTIPIVGIVAGLVLLGAWTGAWSAVMC 
RKKNSDRVSYSEAASSDHAQGSDVSLTACKV 


6839 


1 


1195 


AAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLDALLSEPIPIHG 
RGNFPTL S VQ PRQ I RAGGPQHPGGAG \ IHVHRVRLHGSAASHVL 
HPESGLGYKDLDLVFRMDLRSEASFQIiTKAWLACLLDFLPAGV 
SRAKITPLTLKEAYVQKLVKVCTDSDRWSIjISIiSNKSGKNVELK 
FVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAFHPTVTG 
ESLYGDFTEALEHLRHRVIATRSPEEIRG3GLLKYCHLLVRGFR 
PRPSTDVRALQRYMCSRFFIDFPDLVEQRRTLERYL3AHFGGAD 
AARRYACLVTLHRWNESTVCLMNHERRQTLDLIAALALQALAE 
QGPAATAALAWRPPGTDGWPATVWYYVTPVQPLLAHAYPTWLP 
CN 


6840 


4254 


2061 


ELQC-DFSVPDVPKSMAWCENSICVGFKRDYYLIRVDGKGSIKEIj 
FPTGKQLSPLVAPJ3ADGKVAVGQDDLTWLNEEGICTQKCALNW 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TD I PVAMEHQPP Y 1 1 AVLPHYVE IRTFEPRLLVQS I ELQRPRF I 
TSGGSN 1 1 YVASNHFVWRL I PVPMATQ IQQLLQDKQFELALQLA 
EMKDDSDSEKQQQIHHIKNLYAFNLFCQKRFDESMQVFAKLGTD 
PTHVMGLYPDLLPTDYRKQLQYPNPLPVLSGAELEKAHLALIDY 
LTQKRSQLVKKLNDSDHQSSTSPLMEGTPTIK3KKKLLQIIDTT 
LLKC YLHTNVALVAPLLRLENNHCHI EESEHVLKKAHKYS ELI I 
LYEKKGLHEKALQVLVDQSKKANSPDKGHERTVQYLQHLGTENL 
HLIFSYSWVLiRDFPEDGLKIFTEDLPEVESLPRDRVLGFLIEN 
FKGLAIPYLEHIIHVWEETGSRFHNCLIQLYC3KVQGLMKEYLL 
SFPAGKTPVPAGEEEGELGEYRQKLLMFLEISSYYDPGRLICDF 
PFDGLLEERALLLGRHGKHEQALFIYVH1LKDTRMAEEYCHKHY 
DRNKDGNKDVYLSLLRMYLSPPSIHCLGPIKLELLEPKANLQAA 
LQVLELHHSKLDTTKALNLLPANTQINDIRIFLEKVLEENAQKK 
RFNQVLKNLLHAEFLRV\QEERILHQQVKCIITEEKVCMVCKKK 
IGNSAFARYPNGWVHYFCS\KEVNPADT 


6841 


• 


320S 


TPSTTGTKSWTPTSSyPSAAVTPLNSSLQPLGDYGVGSKWSKRA 
REKRDSRNMEVQVTQEMRNVSIGMGSSDEWSDVQDIIDSTPELD 
MCPETRLDRTGSSPTQGIVNKAFGINTDELYHELSTAGSEVIGD 
VDEGADLLGEFSGMGKEVGNLLLENSQLLETKNALNWKNDLIA 
QEVLRGELEAAKQAWKLENRIKELEEELKRVKSEA 
IIARREPKEEAEDVSSYLCTESDKIPMAQRRRFTRVEMARVLME 
RNQYKERLMET,QEAVRWTEMIRASREHPSVQEKKKSTIWQFFSR 
LFSSSSSFPPAKRPYPSGNIHYKSPTTAGFSQRRNHAMCPISAG 
SRPLEFFPDDDCTSSARREQKREQYRQVREHVRNDDGRLQACGW 
SLPAKYKQLSPNGGQEDTRMKNVPVPVYCRPLVEKDPTMKLWCA 
AGVNLSGWRPNEDDAGNGVKPAPGRDPLTCDREGDGEPKSAHTS 
PEKKKAKELPEMDATSSRVWILTSTLTTSKWIIDANQPGTWD 
QFTVCNAHVLCISSIPAASDSDYPPGEMFLDSDVNPEDPGADGV 
LAGITLVGCATRCNVPRSNCSSRGDTPVLDKGQGEVATIANGKV 
NPSQSTEEATEATEVPDPGPSEPETATLRPGPLTEHVFTDPAPT 
PSSGPQPGSENGPEPDSSSTRPEPEPSGDPTGAGSSAAPTMWLG 
AQNGWLYVHSAVANWKKCLHSIKLKDSVLSLVHVKGRVLVALAD 
GTLAIFHRGEDGQWDLSNYHLMDLGHPHHSIRCMAWYDRVWCG 
YKNKTOVIQPKTMQIEKSFDAHPRRESQVRQLAWIGDGVWVSIR 
LDSTLRLYHAHTHQHLQDVDIEPYVSKMLGTGKLGFSFVRITAL 
LVAG S RLWVGTGNG WI S ~ PLTET WLHRGQ \ LLG\ IjRANKTS P 
TSGEG\ARPGG\IIHVYG\DDSSDRAARSFIPYCSMAQAQLCFH 
GHRDAVKFFVSVPGNVLATLNGSVLDSPAEGPGPAAPASEVEGQ 
KLRWVLVLSGGEGYIDFRIGDGEDDETEEGAGDMSQVKPVLSKA. 


6842 




926 


RCQQLSATILTDHQYLERTPLCAILKQKAPQQYRIRAKLRSYKP 
RRLFQSVKLHCPKCHLLQEVPHEGDLDIIFQDGATKTPDVKLQN 
TSLYDSKIWTTKNQKGRKVAVHFVKNUGILPLSNECLLLIEGGT 
LSEICKLSNKFNSVIPVRSGHEDLELLDLSAPFLIQGTVHHYGC 
KQWST*RSIQNLNSLVDKTSWIPSSVAEALGIVPLQYVFVMTFT 
LDDGTGVLEAYLMDSDKFFQIPASEVLMDDDLQKSVDMIMDMFC 
PPGIKIDAYPWLECFIKSYNVTNGTDNQICYQIFDTTVAEDVI 


6843 


2 


851 


1 HE LSG^KRYECNECGKSFAYTSSLIKHRRIHTGERPYECSE 
CGRS FAENS SLI KHLRVHTGER P YE C VECGKS FRRS S S LLQHQR 
VHTRERP YECSE CGKS FSLRSNL IHHQRVHTGERHE CGQCGKS F 
SRKSSLriHLRVHTGERPYECSDCGKSFAEWSSLIKHLRVHTGE 
RPYECIDCGKSFRHSSSFRRHQ2VHTGMRPYK*SKFWKFSCPGF 
LLLQGQRVHTGSRCYECDKWGIFFS*NASFFT*KSAPTEEVPFE 
CNECEKAFSPLSLVTTIFT 


6844 244 


642 


EHQLAGFE^RKTQTSMSLGTTREKTDRVKSTAYLSPQELEDVFY 
QYDVKSEIYSFGIVLWEIATGDIPFQGCNSEKIRKLVAVKRQQE 
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Predicted end 
icleotide 

irresponding 
> first 
lino acid 

ilno acid 
sequence 



Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalar.ine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
PLGEDCPSELREIIDECRAHDPSVRPSVDEILKKLSTFSK*CIK 
I 

VAVRDECYWRHVFWDQDLWMLLFILMCHPETARARLEYRIRTLD 
GALENAQNLGYQGAKFAWESADSGLEVCPEDIYGVQEVHVNGAV 
GLAFELYYHTTQDLQLFREAGGWDWRAVAEFWCSRVEWSPREE 
KYHLRGVMS PDEYHSGVNNSVYTOTLVQNSLRFAAALAQDLGIiP 
IPSQWLAVADKIKVPFDVEQNFHPEFDGYEPGEWKQADWLLG 
YPVPFSLSPDVRRKNLEIYEAVTSPQGPAMTWSMFAVGWMELKD 
AVRARGLLDRSFANMAEPFKVWTENADGSGAVNFLTGMGGFLQA 
WFGCTGFRVTRAGVTFDPVCLSGISRVSVSGIFYQGNKLNFSE 
SEDSVTVEVTARAGPWAPHLEAELWPSQSRLS1LPGHKVSFPRS 
AGRIQMSPPKLPGSSSSEFPGRTFSDVRDPLQSPLWVTLGSE 
TESLTVDPASE*SGTGASETSLGPSLWPRLHP?LLGTLLACHPS 
PAARLSGKVHAAWPEFKAFCL 



LYFLKTIK*LNRLAEHP*YENEKLTKLRNTIMEQYTI 
1 1 FTKTRQSAYALSQW I TENEKFAE VGVKAHHL IGAGHS SEFKP 
MTQNEQKEVISKFRTGKINLLIATTVAEEGLDtKECNIVIRYGL 
VTNEIAMVQARGRARADESTYVLVAHSGSGVIEHETVNDFREKM 
MYKAIHCVQNMKPEEYAHKILELQMQSIMEKKMKTKRNIAKHYK 
NNPSLITFLCKNCSVLACSGEDIHVIEKKHHVNMTPEFKELYIV 
RENKTLQKKCADYQINGEriCKCGQAWGTMMVHKGLDLPCT.KIR 
NFVWFKNNSTKKQYKKWVELPITFPNLDYSECCIjFSDED 



SMCWNSDRLEMPLIDLALILYPPSYVPYTr™ ST1D 3L 3RKYJLT 
WFEDALNGVL*RAEAIQPHCVNAGDRMEKFRQKYWNKLQTLRQQ 
PFAYGTLTVRSLLDTREHCLNEFNFPDPYSKVKQRENGVALRCF 
PGWRSLDALGWEERQLALVKGLLAGNVFDWGAKAVSAVLESDP 
YFGFEEAKRKLQERPWLVDSYSEWLQRLKGPPHKCALIFADNSG 
IDIILGVFPFVRELLLRGTEVILACNSGPALNDVTHSESLIVAE 
RIAGMDPWHSALREERLLLVQTGSSSPCLDLSRLDKGLAALVR 
ERGADLWIEGMGRAVHTNYHAALRCESLKLAVIKNAWLAERLG 
GRLFSVI FKYEVPAE 



AMWWNSLDGIRNIVLSNPKKRKTLSLAMLKSLQSDILHDADSND 
LKVI IISAEGPVFSSGHDLKELTEEQGRDYHAEVFQTCSKVMMH 
IRNHPVPVIAMVNGLATAAGCQLVASCDIAVASDKSSFATPGVN 
VGLFCSTPGVALARAVPRKVALEMLFTGEPISAQEALLHGLLKfK 
WPEAELQEETMRrARKIASLSRPWSLGKATFYKQLPQDLGTA 
YYLTSQAMVDNLALRDGQEGITAFLQKRKPVWSHEPV*VEH 



SLGVDGSCLEQGSPAPRPQTDTSP*PVGNWATQQEDLYHQSYEC 

VCVLFASVPDFKEFYSESNINHEGLECLRLLNEIIADFDELLSK 

PKFSGVEKIKTIGSTYMAATGLNATSGQDAQQDAERSCSHLGTM 

VEFAVALGSKLDVINKHSFNNFRLRVGLNHGPWAGVIGAQKPQ 

YDIWGKTVKVASRMESTGVLGKIQVTEETAWALQSLGYTCYSRG 

VIKVKGKGQLCTYFLNTDLTRTGPPSATLG 

ARGLNHEWTFEKIjRQHISRNAQDKQELHLFMLSGVPDAVFDLTD 

LDVLKLELIPEAKIPAKISQMTNLQELHLCHCPAKVEQTAFSFL 

RDHLRCLHVKFTDVAEIPAWVYLLKNLRELYLIGNLNSENNKMI 

GLESLRELRHLKILHVKENLTKVPSNITDVAPHLTKLVIHNDGT 

KLLVLNSLKKMMNVAELELQNCELERIPHAIFSLSNLQELDLKS 

NNIRTIEEIISFQHLKRLTCLKLWHNKIVTIPPSITHVKKLESL 

YFSNNKLESLPVAVFSLQKLRCLDVS YIvfNI SM I P IE IGLLQIILQ 

HLHITGNKVDILPKQLFKCIKLRTLNLGQNCITSLPEKVGQLSQ 

LTQLELKGNCLDRLPAQLGQCRMLKKSGLWEDHLFDTLPLEVX 

EALNQDINIPFAMGI 

VSAQVSAREGENCLGWNLADSSQESYKSLEEABDCYPPSLLTLD 
LRDLFNQVEQGPLLSCPKAGTDLSMGRAR3VGWMAAGLMIGAGA 
CYCVYKLTIGRDDSEKLEEEGEEEWDDDQELDEEEPDIWFDFET 
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SEQ 
NO: 


Predicted 

nucleotide 

location 

corresponding 

amino U acid 


Predicted end 

location 
corresponding 

sequence 


Amino acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 

S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MARPWTEDGDWTEPGAPGGTEDRPSGGGKAWRAHPIKQRPFPYE 
HKNTWSAQNCKNGSCVLDLSKCLFIQGKLLFAEPKDAGFPFSQD 
INSHLASLSMARNTSPTPDPTVREALCAPDNLNASIESQGQIKM 
YINEVCRETVSRCCNSFLQQAGLNLLISMTVINNMLAKSASDLK 
FPLISEGSGCAKVQVLKPLMGLSEKPVLAGELVGAQMLFSFMSL 
F I RKTGNRE I LLETPAP 


5852 


1 


407 


RTRGEETYANFIKHNDGKNIFYAARTPATLFAVMFAMYIISGLT 
GFIGLNSIAVLCNLVMGLALIFLCTWAYVKYSGEFREIGTVIDQ 
IAETLWEQVLKPLGDNLM3ENIRQSVTNSIKAGLTDQVSHHAHL 
KTD 








KCDILKALGIEVDVEDGSVSLQVPVSNEIFNSASSHEEDNRSET 
AS SGYASVQGTYEPPLEEHVQS TNE SLQLVNHEANSVAVDVI PH 
VDWPTFEEDETPNQETAVREIKS 


6854 




585 


HESYIGTFDPGEL M l< 'WI.QDNSASYFLNRKLVYEPSTQAK " 

IICVEGFKEHCEEFWHTIRYPNWKHISCKHAESVETEGNGEDLR 
LFHS FEELLLEAHGD YGLRNDYHMNLGQFLE FLKKHKS EHVFQ I 
LFGIESKSSDS 


£855 


1913 


1148 


GRVGGRVGRICSPLSGANEYIASTDTLKTEEVLLFTDQTDDLAK 

YIPEGCIQRWAAEMWAtDAIiHREGIVCRDLNPNNILLNDRGHI 
QLTYFSRWSEVEDSCDSDAIERMYCAPEVGAITEETEACDWWSL 
GAVLFELLTGKTLVECHPAGINTHTTLNMPEWVSEEARSLIQQL 
LQFNPLERLGAGVAGVEDIK5HPFFTPVDWAELMR 


6856 


1617 


997 


VTQLYVSVDASTKDSLKKIDRPLFKDFWQQFLDSLKALAVKQQR 

SSLTMAHVPWHEEWQFVRELVDLIPEYEIACEHEHSKCLLIAH 
RKFKIGGEWWTWINYKRFQELIQEYED5GGSKTFSAKDYMARTP 
HWALFGASERGFDPKDTRHQRKNKSKAISGC 


5857 


1 


617 


KGPEATAMVCVCSHPNCRQNHIKPSHSAAQTWCGSPTPASAPNH " 

KLMAMEQGKTLPSATEDAKEEGLEAQISRLAELIGRLESKALWF 

DLQQRLSDEDGTNMHLQLVRQEMAVCPEQLSEFLDSLRQYLRGT 

TGVRNCFHITAVRLSDGFTFVIYEFWETEEAWKRHLQSPLCXAF 

RHVKVDTLSQPEALSRILVPAAWCTVGRD 








LRCVQTAKLILEELKLEKKIKIRVEPGIFEWTKWEAGKTTPTLM 
SLEELKEANFNIDTDYRPAFPLSALMPAESYQEYMDRCTASMVQ 
IVNTCPQDTGVILIVSHGSTLDSCTRPLLGLPPRECGDFAQLVR 
KIPSLGMCFCEENKEEGKWELVNPPVKTLTHGANAAFNWRNWIS 
GN 


6859 


1 


1150 


GETMFKKAKTKAKKKPRKRSDSSGGYNLSDIIOSPSSTGLLKSG 
KTNSVESLPELLTSDSEGSYAGVGSPRDLQSPDFTTGFHSDK1E 
AKVKPYVNGTSPVYSREDLKPWEKSPILKISAPQPIPSNRIDTT 
SSASWVAGSFSPVSPPWDLRTIMEIEESRQKCGATPKSHLGKT 
VSHGVKLSQKQRKMIALTTKENNSGMNSMETVLFTPSKAPKPVN 
AWASSLHSVSSKSFRDFLLEEKKSVTSHSSGDHVKKVSFKGIEN 
SQAPKIVRCSTHGTPGPEGNHISDLPLLDSPNPWLSSSVTAPSM 
VAPVTFASIVEEELQQEAALIRSREKPLALIQIEEHAIQDLLVF 
YEAFGNPEEFVIVERTPQGPLAVPMWNKHGC 






1515 


DKDKKRQKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQLAQ 
DTGLTILQVNNWFINARR1IVQPMIDQSN3AVSQGAAYSPEGQ? 
MGSFVLDGQQHMGIRPAGPMSGMGMNMGMDGQWHYM 


6861 


1889 


1515 


dkdkkrqkkrgifpkvatnimrawlfqhlthpypseeqkkqlaq 
dtglt:lqvnnwfinarriivqpmidqsnravsqgaayspegqp 
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SEQ 


Predicted 


Predicted end 


Amino acid segment containing signal peptide 




beginning 


nucleotide 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 


NO: 


nucleotide 


location 


Glutamic Acid, F=Phenylalanine , G=Glycine, 






corresponding 


H=Histidine, I=Isoleucine, K=Lysine, 




corresponding 


to first 


L=Leucine, M=Methionine, N=Asparagine , 






amino acid 


P=Proline, Q=Glutamine, R=Arginine, 




amino acid 


residue of 


S=Serine, T= Threonine, V=Valine, 






amino acid 


W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 




amino acid 


sequence 


Codon, /=possible nucleotide deletion, 




sequence 




\=possible nucleotide insertion) 








MGSFVLDGQQHMGIRPAGPMSGMGMNMGMDGQWHYM 


6862 


2 


471 


EEIDREFHNKLKLKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA - 
DEEDPLGPNCYYDKTKSFFDNISCDDNRERRPTWAEERRLNAET 
FGIPLRPNRGRGGYRGRGGLGFRGGRGRGGGRGGTFTAPRGFRG 
GFRGGRGGREFADFEYRKTTAFGP 


6863 


2216 


487 


PQEPALKSEFSQVASNTIPLPLPQPNTCKDNGPCKQVCSTVGGS 
AICSCFPGYAIMADGVSCEDQDECLMGAHDCSRRQFCVNTLGSF 
YCVNHTVLCADGYILNAHRKCVDIN3CVTDLHTCSRGEHCVNTL 
GSFHCYKALTCEPGYALKDGECEDVDECAMGTHTCQPGFLCQNT 
KGSFYCQARQRCMDGFLQDPEGNCVDINECTSLSEPCRPGFSCI 
NTVGSYTCQRNPLICARGYHASDDGTKCVDVNECETGVHRCGEG 
QVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRLCQHT 
CENTLGSYRCSCASGFLLAADGKRCEDVNECEAQRCSQECANIY 
GSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILCTFRCLNVPGS 
YQCACPEQGYTMTANGRSCKDVDECALGTHNCSEAETCHNIQGS 
FR CLRFE CP PNYVQ VS KTKCERTTCHDFLECQNS PAR I TH YQLN 
FQTGLLVPAHI FRIGPAPAFTGD'J 1/ [jNIII 3NE ^YFGTRRLN 
AYTGWYLQRAVLEPRDFALDVEMKLWI !G " r 'TE LAKMHIFFT 
TFAL 


5864 


2 


2933 


LADSSPSNLQIIIKELLSMHHQPDPALTKEFDYLPPVDSRSSSG 
FVGLRNGGATCYMNAVFQQLYMQPGIaPESLLSVDDDTDNPDDSV 
FYQVQSLFGHLM2SKLQYYVPENFWKIFKMWNKELYVREQQDAY 
EFFTSLIDQMDEYLKKMGRDQIFKNTFQGIYSDQKICKDCPHRY 
EREEAFMALNLGVTSCQSLEISLDQFVRGEVLEGSNAYYCEKCK 
EKR I TVKRTC 1 KSLPS VLVIHLMRFGFDWESGRS I KYDEQ1 RFP 
WMLNMEPYTVSGMARQDSSSEVGEWGRSVDQGGGGSPRKKVALT 
ENYELVGVIVHSGQAHAGHYY5FIKDRRGCGKGKWYKFNDTVIE 
EFDLNDETLEYECFGGEYRPKVYDQTNPYTDVRRRYWNAYMLFY 
QRVSDQNSPVLPKKSRVSVVRQEAEDLSLSAPSSPEISPQSSPR 
PHRPNNDRLSILTKLVKKGEKKGLFVEKMPARIYQMVRDENLKF 
MKNRDVYSSDYFSFVLSLASLWATKLKHPYYPCMAKVSLOLAIQ 
FLFQTYLRTKKKLRVDTEEWIATIEALLSKSFDACQWLVEYFIS 
SEGRELIKIFLLECNVREVRVAVATILEKTLDSATjFYQDKLKSL 
HQLLEVLLALLDKDVPENCKNCAQYFFLFNTFVQKQGIRAGDLL 
LRHSALRHMISFLLGASRQNNQIRRWSSAQAREFGNLHNTVALL 
VLHSDVS SQRNVAPGI FKQRPP I S IAPSSPLLPLHEEVEALLFM 
SEGKPYLLEVMFALRELTGSLLALIEMWYCCFCNEHFSFTMLH 
FIKNQLETAPPHELKOTFQLLHEILVIEDPIQVERVKFVFETEN 
GLLALMHHSNHVDSSRCYQCVKFLVTLAQKCPAAKEYFKENSHH 
WSWAVQWLQKKMSEHYWTLQSNVSNETSTGKTFQRTISAQU'l'liA 
YATALLNEKEQSGSSNGSESSPANENGDRHLQQGSESPMMIGEL 
RSDLDDVDP 




1820 


1242 


DPERWKHLSKVTPPGSSVSTTPVQWRLQSPQSQGSMMPSCNRS 
CSCSRGPSVEDGKWYGVRSYLHLFYEGYAVPPKLEGIGEGEFLV 
LDQRAADYNQALGTCRLAGTALCVAAGVLLAICLFWAMIGWLSQ 
DTKAEPLDPEADSHVEVFGDEPEQQLSPIFRNASGQSWFSPPAS 
PFGQSSVQTIQPKRDS 


S8S6 


1571 


495 


DCPRPRYTLYGLRATCMRDLDWAWINAVSAFKALEQDLPVNIKF 
IIEGMEEAGSVALEELVEKEKDRFFSGVDYIVISDNLWISQRKP 
AITYGTRGNSYFMVEVKCRDQDFHSGTFGGILHEPMADLVALLG 
SLVDSSGHILVPGIYDEWPLTEEEINTYKAIHLDLEEYRNSSR 
VEKFLFDTKEEILMHLWRYPSLSIHGIEGAFDEPGTKTVIPGRV 
IGKFSI RLVPHMNVS AVE KQVTRHLEDVFS KRNS SWKMWSMTL 
C-LHPWIANIDDTQYLAAKRAIRTVFGTEPDMIRDGSTIPIAKMF 
QEIVHKSWLIPLGAVDDGEHSQNEKINRWNYIEGTKLFAAFFL 
EMAQLH 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 

to first 
amino acid 

amino acid 
sequence 


Predicted end 

location 
corresponding 

amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 

Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


68S7 


2833 


1704 


GTRIMSQPKQKELAGFVRQKMLLDYSVYMGRCVPQESRSPQRSP 
LQSAESSPTAGKKLPEVPPSEEEEQEAWVNALLGRIFWDFL3EK 
YWSDLVSKKI QMKLSKI KLPYFMNELTLTELDMGVAVPKILQAF 
KPYVDHQGLW1DLEMSYNGSFLMTLETKMNLTKLGKEPLVEALK 
VGEIGKEGCRPRAFCLADSDEESSSAGSSEEDDAPEPSGGDKQL 
LPGAEGYVGGHRTSKIMRFVDKITKSKYFQKATETEFIKKKIEE t 
VSNT PLLLTVEVQECRGTLAVNI P P PPTDRVWYGFRKP PHVELK 
ARPKLGEREVTLVHVTDWIEKKLEQEFQKVFVMPNMDDVYITIM 
HSAMDPRSTSCLLKDPPVEAADQP 






346 


RPTRPPTRPEEIKNLILPYISDMNFVQDLCEDFYELFKTDKGFD 
GSQGRIVHLSNSFTQTVNCRKPFFSSW 


G8S9 


3 


1619 


MYMERMDKRALISFWESVEHLKNANKNEIPQLVGEIYQNFFVES 
KEISVEKSLYKEIQQCLVGNKGIEVFYKIQEDVYETLKDRYYPS 
FIVSDLYEKLLIKEEEKHASQMISNKDEMGPRDEAGEEAVDDGT 
NQINEQASFAVWKLRELNEKLEYKRQALNSIQNAPKPDKKIVSK 
I'll ILIEKERTDLQLHMARTDWWCENLGMWKASITSGEVTEE 
NGEQLPCYFVMVSLQEVGGVETKNWTVPKRLSEFHNLHRKLSEC 
VPSLKKDQLPSLSKLPFKSIDHTFMEKFENQLNKFLQNLLSDER 
LCQSEALYAFLSPSPDYLKVIDVQGKKNSFSLSSFLERLPRDFF 
SHQEEETEEDSDLSDYGDDVDGRKDAIjAEPCFMLIGEIFELRGM 
FKWVRRTLIALVQVTFGRTINKQIRDTVSWIFSEQMLVYYINIF 
RDAFWPNGKLAPPTTIRSKEQSQETKQRAQQKLLENIPDMLQSL 
VGQQNARHGI IKI FNALQETRANKH 1 ,1 ,YA1 ,ME LLLIELCPELRV 
HLDQLKAGQV 


6870 




1566 


maawaatrwwqlllvlsaagmgasga?qppnili,i,l,m5dmgwg 
dlgvygepsretpnldrmaaegllfpnfysanplcspsraallt 
grlp i rngfyttnaharnaytpqe i vgg i pds eqllpellkkag 
yvskivgkwhlghrpqfhplkhgfdewfgspnchfgpydnkarp 

ARHHP F FL YWAVDATHAP VYAS KP FLGTS QRGR YGDAYRE I DDS 
IGKILELLQDLHVADNTFVFFT5DNGAALISAPEQGGSNGPFLC 
GKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 

QHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTH1ILEDHTKLPL 
IFHLGRDPGERFPLSFASAEYQEALSRIT3WQQHQEALVPAQP 
QLNVCNWAVMNWAPPGCEKLGKCLTP PES I PKKCLWSH 


6871 


209 


1126 


RMSLNPPIFLKRSEENSSKFVETKQSQTTSIASEDPLQNLCLAS 
QEVLQKAQQSGRS KCLKCGGSRMFYC YTCYVPVENVP IEQIPLV 
KLPLKIDIIKHPNETDGKSTAIHAKLLAPEFVNIYTYPCIPEYE 
EKDHEVALIFPGPQSISIKDISFHLQKRIQNNVRGKNDDPDKPS 
FKRKRTEEQEFCDLNDSKCKGTTLKKI^FIDSTWNQTNKIFTDE 
RLQGLLQVELKTRKTCFWRHQKGKPDTFLSTIEAIYYFLVDYHT 
DILKEKYRGQYDNLLFFYSFMYQLIKNAKCSGDKETGKLTH 


6872 


880 


459 


FC-LLMWLS LI FMKGNCVREDL I FNFLFKLGLDVRETNGLFGNT 
KKLITEVFVRQKYLEYRRIPYTEPAEYEFLWGPRAFLETSKMLV 
LRFLAKLHKKDPQSWPFHYLEALAECEWEDTDEDEPDTGDSAHG 
PTSRPPPR 


6873 


1929 


955 


DEQAVLCSKDKTYDLKIADTSNMLLFIPGCKTPDQLKKEDSHCN 
IIHTEIFGFSNNYWELRRRRPKLKKLKKLLMENPYEGPDSQKEK 
DSNSSKYTTEDLLDQIQASEEEIMTQLQVLNACKIGGYWRILEF 
DYEMKLLNHVTQLVDSESWSFGKVPLKTCLQELGPLEPEEMIEH 
CLKCYGKKYVDEGEVYFELDADKICRAAARMLLQlvAVKFNLAEF 
QEVWQQSVPEGMVTSLDQLKGLALVDRHSRPEIIFLLKVDDLPE 
DNQERFNSLFSLREKWTEEDIAPYIQDLCGEKQTIGALLTKYSH 
SSMQNGVKVYNSRRPIS | 



565 



WO 01/53312 



PCT7US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue o£ 


Predicted end 
nucleotide 

corresponding 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lyslne, 
L=Leucxne, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine,. R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 


6874 


1 


307 


DSIADHVNSAAVNVEEGTKNLGKAAKYKLAALPVAGALIGGMVG 
GPIGLLAGFKVAGIAAALGGGVLGFTGGKLIQRKKQKMMEKLTS 
SCPDLPSQTDKKCS 


6875 


1688 


349 


VIGTGERGNSASEKWEIMFNEELGDPFIIIHSISLLNAEEHSIA 
TLLLRIEKEELDMKGSGFYVSLEWVTISKKNQDNKKYEIIKREI 
LRGKSVPHYAAIEPDGNGLMIVSYKSLTFVQAGQDLEENMDEDI 
SEKIKEPLYYWQQTEDDLTVTIRLPEDNTKEDIQIQFIiPDHINI 
VLKDHQFLEGKLYSSIDHESSTWIIKESNSLEISLIKKNEGLTW 
PELVIGDKQGELIRDSAQCAAIAERLMHLTSEELNPNPDKEKPP 
CNAQELEECDIFFEESSSLCRFDGNTLKTTHVVNLGSNQYLiFSV 
IVDPKEMPCFCLRHDVDA1LWQPHSSKQDDMWEHIATFNALGYV 
QASKRDKKFFACAPNYSYAALCECLRRVFIYRQPAPMSTVLYNR 
KEGRQVGQVAKQQVASLETNDPILGFQATNERLFVLTTKKLFLI 






1235 


VGEMTLIWRHLLRPLCLVTSAPRILEMHPFLSl.G'l'SHTSVTKLS 
LHTKPRMPPCDFMPERYQVIFLWSGSEAKE" MT 1 1A£ ' N~ 
DI I S FRGAYHGCSPYTLGLTNVG I YKMELPGGTGCQPTMCPDVF 
RGPWGGSHCRDSPVQTIRKCSCAPDCCQAKDQYIEQFKDTLSTS 
VAKSIAGFFAEPIQGYNGWQYPKGFLKEAFELVRARGGVCIAN 
3VQTGFGRLGSHFWGFQT]JDVLPDIVTMAKGIGNGFP.MAAVITT 
PEIAKSLAKCLQHFNTFGGNPMACAIGSAVLEVIKEENLQENSQ 
1 l r T f ^iF^EFEIVGDVRGKGLMIGIEMVQDKISCRPL 
PREEVNQIHEDCKHMGLLVGRGSIFSQTFRIAPSMCITKPEVDF 
AVEVFRSALTQHMERRAK 


6877 


| * 


778 


GTSPSPARAYAPPTERKRFYQNVSITQ3EGGFEINLDHRKLKTP 
QAKLFTVPSEALAIAVATEWDSQQDTIKYYTMHLTTLCNTSLDN 
PTQRNKDQL I RAAVKFLDTDT I CYRVE E PETLVE LQRKEWD P I I 
EWAEKRYGVEISSSTSIMGPSIPAKTREVLVSHLASYNTWALQG 
IEFVAAQLKSMVLTLGLIDLRLTVEQAVLLSRLEEEYQIOKWGN 
IEWAHDYELQELRARTAAGTLFIHLCSESTTVKHKLLKE 


6878 


931 


263 


QTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEySAPSEQGQN 
LEEDTVTLEVLVAPAVPSCEVPSSALSGTWELRCQDKEGNPAP 
EYTWFKDGIRLLENPRLGSQSTNSSYTMNTKTGTLQFNTVSXLD 
TGEYSCEARNSVGYRRCPGKRMQVDDLNISGIIAAVWVALVIS 

II 


6879 


3 


845 


IRVIGESDIMQEFLSESDKNYNGVSDVELRVALPDGTTVTVRVK 

FPHKLYIQNYTSAVPGTCLTIRKWLFTTEEEILLNDNDLAVTYF 
FHQAVDDVKKGYIKAEEKSYQLQKLYEQRKMVMYIiNMLRTCEGY 
NEIIFPHCACDSRRKGHVITAISITHFKLHACTEEGQLENQVIA 
FEWDEMQRWDTDEEGMAFCFEYARGEKKPRWVKIFTPYFNYMHE 
CFERVFCELKWRKEEY 


6880 


2110 


1437 


RKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISS 
MANIYNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRS 
NRDQIKNCGFFYGHRAFSVALIFVTAFQGLSVAFILKFLDNMFH 
VLMAQVTTVIITTVSVLVFDFRPSLEFFLEAPSVLLSIFIYNAS 
KPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESD 
EDTF 


6881 


2638 


2244 


NDSKWEDiHVITGALKMFFRELPEPLFTFNHFNDFVNAIKQEPR 
QRVAAVKDLIRQLPKPNQDTMQILFRHLRRVIENGEKNRMTYQS 
IAIVFGPTLLKPEKETGNIAVHTVYQNQIVELILLELSSIFGR 




1 


850 


GIPEAQLWIYPVKSCKGVPVSEAECTAMGLRSGNLRDRFWLVIN 
QEGKMVTARQEPRLVLISLTCDGDTLTLSAAYTKDLLLPIKTPT 
TNAVHKCRVHGLEIEGRDCGEATAQWITSFLKSQPYRLVHFEPH 
MRPRRPHQIADLFRPKDQIAYSDTSPFLILSEASLADIJSrSRLEK 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 

nucleotide 

location 

to first 9 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide delation, 
\=possible nucleotide insertion) 








KVKATNFEPNIVISGCDVyAEDSWDELLIGDVELKRVMACSKCI 
LTTVDPDTGVMSRKEPLErLKSYRQCDPSERKLYGKSPLFGQYF 
VLENPGTI KVGDPVYLLGQ 


S883 


2794 


2255 


NSKLKLNQNLKLFITLTYQVLSLHGWGPGIHLQKEGAFPVTQKR 
ALQLLYDLRYLNIVLTAKGDEVKSGRSKPDSRIEKVTDHLEALI 
DPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTENQLAPRSSTF 
NSQEPHNILPLASSQIRFGLLPLSMTSTRKAKSTRNIETKAQYD 
ANC 


5884 




99 


EFERVTAEAVKPRETSEPRAAAQRFCEKFPFL 


5885 




1554 


STGQFWHVTDLHLDPTYHITDDHTKVCASSKGANASNPGPFGDV 
LCDSPYQLILSAFDFIKNSGQEASFMIWTGDSPPHVPVPELSTD 
TVINVITNMTTTIQSLFPNLQVFPALGNHDYWPQDQLSWTSKV 
YNAVANLWKPWLDEEAISTLRKGGFYSQKVTTWPNLRIISLNTN 
LYYGPNIMTLNKTDPANQFEWLESTLNNSQQNKEKVYIIAHVPV 
GYLPSSQNITAMREYYNEKLIDIFQKYSDVIAGQFYGHTHRDSI 

DYKLLDMLQYY LNLTEANLKGE? IK} E in I 1DIEDLQPES 
LYGLAKQFTI LDS KQF I KYYNYFFVS YDSSVTCDKTCKAFQ ICA 
IMNLDNISYADCLKQLYIKHNY 




2 


1341 


QCGGIPGKEGGSBRPLEEGTGSSPACVRGAAPGSEDAFYPTRAK 
QARVSQELKKAAKRTVSISEGPDTLGDGMRERRETLALAPEPEP 
LEKEACEKWKRPFRSASATSLTLSHCVDWKGLLDFKKRRGHSI 
GGAPEQRYQIIPVCVAARLPTRAQDVLDAHLSEVNAVRFGPNSS 
LLATGG. 2 XiRLIHL"j_-lVV/i.- JRLEi l-.'Q'-'LIGAGGS ITSVDFDPSGY 
QVLAATYNQAAQLWKVGEAQSKETLSGHKDKVTAAKFKLTRHQA 
VTGSRDRTVKEWDLGRAYCSRTINVLSYCNDWCGDHIIISGHN 
DQKIRFWDSRGPHCTQVIPVQGRVTSLSLSHDQLHLLSCSRDNT 
LKVIDLRVSNIRQVFRADGFKCGSDWTKAVFSPDRSYALAGSCD 
GALYIWDVDTGKLESRLQGPHCAAVNAVAWCYSGSHMVSVDQGR 
KWLWQ 




1047 


116 


WTARPSQKPFWEAGAVPGDPLSTGCSQAQLGGCCPRGPWGPQHG 
GQQRAAGPTLPRGERGGPQQSGPGIAAQTPPTSKQVAWRAFLTG 
TYRSQSPRSPAGPFRGGTGWWPEPAVCLCVAVGPQRLSSPGLVY 
WASGSEHCYDIYRLYHSCADPTCCGTGPDARAWDYQACTEINLT 
FASNNVTDMFPDLPFTDELRQRYCLDTWGWPRPDWLLTSFWGG 
DLRAASNIIFSNGNLDPWAGGGIRRNLSASVIAVTIQGGAHHLD 
LRASHPEDPASWEARKLEATIIGEWVKAARREQQPALRGGPRL 
Sh 


6888 


1 


992 


FVAYVKKEIPHIWTHCLLNPHALVIKTLPTKLRDALFTWRVI 
NFI KGRAPNHRLFQAFFEE IG I EYS VLLFHTEMRWLSRGQILTH 
IFEMYEEINQFLHHKSSNLVDGFENKEFKIHLAYLADLFKHLNE 
LSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIEKRNFTNFPFL 
E3IIVSDNEGIFIAAEITLHLQQLSNFFHGYFSIGDLNEASKWI 
LI) ? FL FN I DFVDDS YLMKNDLAELRASGQ ILMEFETMKLEDFWC 
AQFTAFPNLAKTALE ILMPFATTYLCELGFS ITFTFQNKVPEAA 
LILSDDIRVAISKKVPSFLGHH 


6889 


1 


1534 


LTL3NQIKEEREQDNSESPNGRTSPLVSQNNEQGSTLRD1,LTTT 
AGKLRVGSTDAGIAFAPVYSMGAPSSKSGRTMPNILDDIIASW 
ENKIPPSKTSKINVKPELKEEPEESIISAVDENNKLYSDIPHSW 
ICEaHILWLKDYKNSSNWKLFKECWKQGQPAWSGVHKKMNISL 
1 I AE IS1DFGDHQADLLNCKDSIISNAOTKEFWDGFEEVSKRQ 
KIIKSGETWLKLKDWPSGEDFKTMMPARYEDLLKSLPLPEYCNP 
EGKFKLASHLPGFFVRPDLGPRLCSAYGWAAKDHDIGTTNLHI 
EVSDWNILVYVGIAKGNGILSKAGILKKFEEEDIiDDILRXRLK 
DSSEIPGALWHIYAGKDVDKIREFLQKISKEQGLEVLPEHDPIR 
DQSWYVNKKLRQRLLEEYGVRTWTLIQFLGDAIVLPAGALHQVQ 
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Predicted 
beginning 
nucleotide 

correspondir 
j first 
nino acid 
ssidue of 
nino acid 

sequence 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=lsoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



PLYLHRALWHKTAETCNSPPCGAKDSLIFGAITCF 
TGFLGVDTGAGATRWCRLKTQEADPLVCAVGMLGSAIFICLIFV 
AAKSS I VGAY I CIFVGETLLFSNWAI TAD ILMYWI PTRRATAV 
ALQSFTSHLLGDAGSPYLIGFISDLIRQSTKDSPLWEFLSLGYA 
LMLCPFVWLGGMFFLATALFFVSDRARAEQQVNQLAMPPASVK 



LRIHQELLS KELKLLRGI TIE S I IH I GIAAGKEQFMQDASNVMQ 
LLLKTQSHLYNMEDNNPEVRQAAAYGLGVMAQFGGDDYRSLCSE 
AVPLLVKVIKRAHSKTKKNVIATENCISAIGKILKFKPNCVNVD 
EVLPHWLSWLPLHEDKEEAIQTLSFLCDLIESNHPWIGPNNSN 
LPKI ISIIAEGKINETINYEDPCAKRLANWRQVQTSEDLWLEC 
VSQLDDEQQEALQELLNFA 



RSVAAASGPGAWGTDHYCLELLKKKDYEGYLCSLLLPAES^SSV 
FALRAFNVELAQVKDSVSEKTIGLMRMQFWKKTVEEIYCDMPPH 
QPVAIELWKAVKRHNLTKRWLMKIVDEREKNLDDKAYRNIKELE 
N YAENTQS SLLYLTLE I LG I KDLHADHAASH I GKAQGI VTCLRA 
TPYHGSRRKVFLPMDICMLHGVSQEDFLRRNQDKNVRDVIYDIA 
SQAHLHLKHARSFHKTVPVKAFPAFLQTVSLEDFLKKIQRVDFD 
IFHPSLQQKNTLLPLYLYIQSWRKTY 

DGERKSMSVERTFSEINKAEEQYSLCQELCSELAQDLQKERLKG 
RTVTIKLKNVNFEVKTRASTVSSWSTAEEIFAIAKELLKTEID 
ADFPHPLRLRLMGVRI S S FPNEEDRKHQQRS 1 I GFLQAGNQALS 
ATECTLEKTDKDKFVKPLEMSHKKSFFDKKRSERKWSHQDTFKC 
EAVNKQSFQTSQPFQVLKKKMNENLEI3ENSDDCQILTCPVCFR 
AQGCISLEALNKHVDECLDGPSISENFXMFSCSHVSATKVNKKE 

NVPAS S LCEKQD YEAH 

TTLCKPLVPREHQFYETLPAEMRKFTPQYKGKSQLLEGLPHWR3 
DVRDRGHGRPWQPSLEPSLPPTLCFPSLSSFSSSWPSAQHLTPS 
VFNPW 

VTYVELCDLASPTALLIMRTVLDLIVEDLQSTSEDKEQQYTSQT 
TRLLALLYALASHKACKLAILHLINGTIKGDERYAEIFQDLLAL 
VRSPGDSVIRQQCVEYVTSILQSLCDQDIALILPSS3EGSISEL 
EQLSNSLPNKELMTS I CDCLLATLANSESSYNCLLTCVRTMMFL 
AEIIDYGLFHLKSSLRKNSSALHSLLKRWSTFSKDTGELASSFL 
EFMRQILNSDTIGCCGDDNGLMEVEGAHTSRTMSINAAELKQLL 
QSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSWGLKQMLES 
SGDPLPLSUQDVEPVLSAPESLQNLFMNRTAYVLADWDDQLKS 
MWFTPFQAEEIDTDLDLVKVDLIELSEKCCSDFDLHSELERSFI 
SEPSSPGRTKTTKGFKLGKHKHETFITSSGKSEYIEPAKRAHW 
PPPRGRGRGGFGQGIRPHDIFRQRKQNTSRPPSMHVDDFVAAES 
KEWPQDGIPPPKRPLKVSQKISSRGGFSGNRGGRGAFHSQNRF 
FTPPASKGNYSRREGTRGSSWSAQNTPRGNYNESRGGQSNFNRG 
PLPPLRPLSSTGYRPSPRDRASRGRGGLGPSWASANSGSGGSRG 

KFVS GG5 GRGRHVR5 FTR 

GNIVIQKKKYNKQHIIPLENVTIDSIKDEGDLRNGWliIKTPTKS 
FAVYAATATEKSEWMHHINKCVTDLLSKSGKTPSNEHAAVWVPD 
SEATVCMRCQKAKFTPVNRRHHCRKCGFWCGPCSEKRFLLPSQ 
SSKPVRICDFCYDDLSAGDMATCQPARSDSYSQSLKSPLNDMSD 
DDDDDDSSD 

GDGLMHEVWGLMERPDWETAIQKPLCSLPAGSGNALAASLNHY " 

AGYEQVTNEDLLTNCTLLLCRRLLSPMNLLSLHTASGLRLFSVL 

SLAWGFIADVDLESEKYRRLGEMRFTLGTFLRLAALRTYRGRLA 

YLPVGRVGSKTPASPVWQQGPVDAHLVPLEEPVPSHWTWPDE 

DFVLVLALLHSHLGSEMFA^MGRCaAGVMHLFYVRAGVSSAMri 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 
amino acid 


Predicted end 

location 
corresponding 
to first 

residue cf 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cystaine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , JT=Asparagir.e , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LRLFLAMEKGRHMEYECPYLVYVPWAFRLEPKDGKGVFAVDGE 
LMVS E AVQGQ VHPN Y FWMV SGC VE P P PSWKPQQMPP PEE PL 


6898 


919 


346 


QKTVTAVASLLKGRQGIYTENERRMGAVIKIRFFKIMLVLIICW 
LSNIINESLLFYLEMQTDINGGSLKPVRTAAKTTWFIMGILNPA 
QGFLLSLAFYGWTGCSLGFQSPRKEIQWESLTTSAAEGAHPSPL 
MPHENPASGKVSQVGGQTSDEALSMLSEGSDASTIEIHTASESC 
NKNEGDPALPTHGDL 


6899 


120 


827 


MKVRKNNDAYLLDI*CNKINMDCFISCFFKKMLTTLMFSHSGILSL 
LEHGEEYTFSLPCAYARSILTVPWVELGGKVSVNCAKTGYSASI 
TFHTKPFYGGKLHRVTAEVKHNITNTWCRVQGEWNSVLEFTYS 
NGETKYVDLTKLAVTKKRVRPLEKQDPFESRRLWKNVTDSLRES 
EIDKATEHKHTLEERQRTEERHRTETGTPWKTKYFIKEGDGWVY 
HKPLWKIIPTTQPAE 


6900 


3 


451 


TEVLGSKGIHELRSSTSALHHALEESASLLTMFWRAALPSTHIP 
VLPGKVGESTERELLELRTKVSQQEQLLQSTTEHLKNANQQKES 
MEQFIVSQLTRTHDVLKKARTNLEWKLLHQSEAPSLSPTHHHP 
LADLVGDSWPALR FQE K 


5901 


1 


201 


CD! ir Fl TLQQQSrLEQWAASfLDNVMMQALKPYEGRP 
SFPXAARQFLLKWSFYRYHLGFS 




2 


2S7 


GA1 i'iT?SQP?ROPPQAAPSSHPHSDLTFNPSSAjEGQAGAQGA 
SDMPEPSLDLLPELTNPDELLSYLDPPDLPSNSNDDIjLSLFENN 


6903 






I ^YRQGPTGIHILVIDQ jil- F.^L 1 r < < i ] 
HULK 


6904 




2092 


MEASLP ']_, ^~ rl-1 TL' ; r NFDLLL.C 
VGNFFGSTQDAEWEEYKTGIKKAP1QTYVLGANNQETVKYFQDA 
DGCELAENITYLGRKGIFTGSSGLQIVYLSGTESLNEPVPGYSF 
SPKDVSSLRMMLCTTSQFKGVDILLTSPWPKCVGNFGNSSGEVD 
TKKCGSALVSSLATGLKPRYHFAALEKTYYERLP YRNHI 1 LQEN 
AQHATRFIALANVGNPEKKKYLYAFSIVPMKLMDAA3L.VKQPPD 
VTENPYRKSGQEASIGKQILAPVEESACQFFFDLNEKQGRKRSS 
TGRDSKSSPHPKQPRKPPQPPGPCWFCLASPEVEXHLWNIGTH 
CYLALAKGGLSDDHVLILPIGHYQSWELSAEWEEVEKYKATL 
RRFFKSRGKWCWFERNYKSHHLQLQVIPVPISCSTTDD1KDAF 
ITQAQEQQIELLEIPEIISDIKQIAQPGAAYFYVELDTGERLFHR 
I KKNFPLQFGREVLAS EAI LNVPDKSDWRQCQ 1 SKEDEETLARR 
FRKDFEPYDFTLDD 


6305 




226 


VSKTGEAETITSHYLFALGVYRTLYLFNWIWRYHFEGFFDLIAI 
VAGLVQTVLYCDFFYLYITKVLKGKKLSLPA 


6906 


3 


611 


SYDDHNGHIDFITAASNLRAKMYSIEPADRFKTKRIAGKIIPAI 
ATTTATVSGLVALEMIKVTGGYPFEAYKNWFLNLAIPIWFTET 
TEVRKTKIRNGISFTIWDRWTVHGKEDFTLLDFINAVKEKYGIE 
PTMWQGVKMLYVPVMPGHAICRLKLTMHKLVKPTTEKKYVDIiTV 
S FAPD IDGDEDLPGP PVRYYFSHDTD 


6907 


2 


2228 


LRGVPVWAAGAFRFSSGEESTSHLIMSRRSQRLTRYSQGDDDGS 
SSSGGSSVAGSQSTLFKDSPLRTLKRKSSNMKRLSPAPQLGPSS 
DAHTSYYSESLVHESWFPPRSSLEELHGDANWGEDLRVRRRRGT 
GGSESSRASGLVGRKATEDFLGSSSGYSSEDDYVGYSDVDQQSS 
SSRLRSAVSRAGSLLWMVATSPGRLFRLLYWWAGTTWYRLTTAA 
SLLDVFVLTRRFSSLKTFLWFLLPLLLLTCLTYGAWYFYPYGLQ 
TFHPALVSWWAAKDSRRADEGWEARDSSPHFQAEQRVMSRVHSL 
ERRLEALAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSHED 
TLALLEGLVSRREAALKEDFRRETAARIQEELSALRAEHQQDSE 
DLFKKIVRASQESEARIQQLKSEWQSMTQESFQESSVKELRRLE 
DQLAGLQQELAALALKQSSVAEEVGLLPQQIQAVRDDVESQFPA 
WISQ FLARGGGGRVGLLQREEMQAQLRE LESKI LTH\ r AEMQGKS 



569 
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ID 
NO: 


beginning 

corresponding 

residue of 
amino acid 


nucleotide 
location 
corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
LsLeucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine , _ X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AREAAASLSLTLQKEGVIGVTL^O v n J ! i^EDRIGLA 
DYALESGGASVISTRCSETYETKTALLSLFGIPLWYHSQSPRVI 
LQ.PDVHPGNCWAFQGPQGFAVVRLSARIRPTAVTLEHVPKALSP 
NSTISSAPKDFAIFGFDEDLQQEGTLLGKFTYDQDGEPIQTFHF 
QAPTMATYQWELRILTNWGHPEYTCIYRFRVHGEPAH 


«. 


3 


780 


QVPSAAWLMAVCGLGSRLGLGSRLGLQGCFGAARLLYPRFQSRG 
PQGVEDGDRPQPSSKTPRIPK1YTKTGDKGFSSTFTGERRPKDD 
QVFEAVGTTDELSSAIGFALELVTEKGHTFAEELQKIQCTLQDV 
GSALATPCSSAREAHLKYTTFKAGPILELEQWIDKYTSQLPPLT 
AFILPSGGKISSALHFCRAVCRRAERRWPLVQMGETDANVAKF 
LNRLSDYLFTLARYAAMKEGNQEKIYKKNDPSAESEGL 








GRLLAVGTDLYGQRSSAPEQELLVQDATPVSNSLLPEKAFSDIP 
SPYLRGTIKMMQAVRQAFQDQDDRRTWDGRPLTMAATFDDCLYA 
LCWDTIKRSSQTGEWQNIAIMTEEPELSPAYLI3EAMRRSRMS 
LYC 


6910 


1 


1058 


LVPVWIDSYYYGKLVIAPLNIVLYNIFTPHGPDLYGTEPWYFY 
LINGFLNFNVAFALALLVLPLTSLMEYLLQRFHVQNLGHPYWLT 
IAPMYIWFIIFFIQPHKEERFLFPVYPLICLCGAVALSALQHSF 
LYFQKCYHFVFQRYRLEIIYTVTSNWLALGTVFLFGLLSFSRSVA 

PSSFLLPDNWQLQFIPSEFRGQLPKPFAEGPIATRIVPTDMSTDQ 
NLEEPSRYIDISKCHYLVDLDTMRETPREPKYSSNKEEWISLAY 
RPFLnASRSSKLLRAFYVPFLSDQYTVYVFYTILKPRKAKQIRK 
KSGG 




1184 




PEAAEPGQICCDKPVLRDMNPWSTAIVAF 


6912 




844 


AMKPVETHSFQMLFTILSTGSALKAQSYEDAYRCIKSSILLGSI 

GESGELVCTKPIPCQPTHFWNTjENGNKYRKAYFSKFPGIWAHGD 
YCRINPKTGGIVMLGRSDGTLNPNGVRFGSSEIYNIVESFEEVE 

SARHVPSLILETKGIPYTLNGKKVEVAVKQIIAGKAVEQGGAFS 
NPETLDLYRDI PELQGF 










6914 


1251 


615 


ELAAECKSAGYPGTLIPYRCDLSNEEDILSMFSAIRSCHSGVDI 
CINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTREAYQSMK 
ERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLR 
QELREAQTHIRATCISPGWETQFAFKLHDKDPEKAAATYEQMK 
CLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 


6915 


254 


652 


GRSLSFKTFLIWVLISIYQGGILMYGALVLFESEFVHWAISFT 
ALILTELLMVALTVRTWHWLMWAEFLSLGCYVSSLAFLNEYFD 
VAF I TTVTFLWKVSAIT WS CLPLYVLXYLRRKLS P PSYCKLAS 




254 


652 


GRSLSFKTFLIWVLISIYQGGILMYGALVLFESEFVHWAISFT 
ALILTELLMVALTVRTWHWLMWAEFLSLGCYVSSLAFLNEYFD 
VAF I TTVTFLWKVSAIT WS CLPL YVLKYLRRKLSP PSYCKLAS 


6917 


254 


652 


GRSLSFKTFLIMVLISIYQGGILMYGALVLFESEFVHWAISFT 
ALILTELLMVALTVRTWHWLMWAEFLSLGCYVSSLAFLNEYFD 
VAF I TTVTFLWKVS AI T WS CLPLYVLKYLRRKLS P P SYCKLAS 


6918 


28 


921 


PEAGTRSWREPDPEDLRRFLLSAACRSFPQWLPGGGGGQVSSCS 
DTDVPYLLLAVKSEPGRFAERQAVRETWG3PAPGIRLLFLLGSP 
VGEAGPDLDSLVAWESRRYSDLLLWDFLDVPFNQTLKDLLLLAW 
LGRHCPT VS F VLRAQDDAF VHTPALLAHLRAL P PAS ARS L YLGE 
VFTQAMPLRKPGGPFYVPESFFEGGYPAYASGGGYVIAGRLAPW 
LLRAAARVAPFPFEDVYTGLCIRALGLVPQAHPGFLTAWPADRT 
ADHCAFRNLLLVRPLGPQASIRLWKQLQDPRLQC 



570 
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SEQ 
ID 


Predicted 

location^ 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=3top 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6919 


850 




QGRRELSGSVFCPFIQQEPKEMLTLSEYHERVRSQGQQLQQLQA 
ELDKLHKEVSTVRAANSERVAKLVFQRLNEDFVRKPDYALSSVG 
ASIDLQKTSHDYADRNTAYFWNRFSFKNYARPPTVILEPHVFPG 
NCWAFEGDQGQWIQLPGRVQLSDITLQHPPPSVEHTGGANSAP 
RDFAVFFLLSFFTHQGLQVYDETEVSLGKFTFDVEKSEIQTFHL 
QNDPPAAFPKVKIQILSNWGHPRFTCLYP.VRAHGVRTSEGAEGS 


6920 


1418 


591 


EAQGPSKVHLTLKKKK 


6921 


2 


1711 


MNATRSEEQFHVINHAEQTLRKMENYLKEKQLCDVLLIAGHLRI 
P AHRLVLS AVS D Y FAAM FTNDVLEAKQE E VRMEG VD PNALNS LV 
QYAYTGVLQLKEDTIESLLAAACLLQLTQVIDVCSNFLIKQLHP 
SNCLGIRSFGDAQGCTELLOTAHKYrMEHFIEVIKNQEFLLLPA 
NEISKLLCSDDINVPDEETIFHALMQWVGHDVQNRQGELGMLLS 
YIRLPLLPPQLLADLETSSMFTGDLECQKLLMEAMKYHLLPERR 
SMMQSPRTKPRKSTVGALYAVGGMDAMKGTTTIEKYDLRTNSWL 
HIGTMNGRRLQFGVAVIDNKLYWGGRDGLKTLNTVECFNPVGK 
IWTVMPPMSTHRHGLGVATLEGPMYAVGGHDGWSYLNTVERWDP 
EGRQWNYVASMSTPRSTVGWALNNKLYAIGGRDGSSCLKSMEY 
FDPHTNKt"_l P[ 'J _\ 3 ^ATYNGFLYWGGHDAPASNHC 
S HLSDCVERYDPKGDSWSTVAPLSVPRDAVAVCPLGDKLYVVGG 
YDGHTYLNTVESYDAQRNEWKEEVPVNIGRAGACVWVKLP 


5922 


1075 


369 


LTPPAGIRHEVRDREREREREi.il- k FPU r , ELKQNIHSIT 
GLPPAMQKVMYKGLAPEDKTLREIKVTSGAKIMGGGSTINDVLA 
VNTPKDAAQQDAKAEENKKEPLCRQKQHRKVLDKGKPBDVMPSV 
KGAQERLPTVPLSGMYNKSGGKVRLTFKLEQDQLWIGTKERTEK 
LPMGSIKNWSEPIEGHEDYHMMAFQL3PTEASYYWVYWVPTQY 
VDAI KDTVLGKWQYF 








LGLFCILPIDTLCAVLERDTLSIRESRLFGAWRWAEAECQRQQ 
LPVTFGNKQKVLGKALSLIRFPLMTIEEFAAGPAQSGILSDREV 
VNLFLHFTVNPKPRVEYIDRPRCCLRGKECCIIIRJQQVESRWGY 
SGTSDRIRFTVNRRISIVGFGLYGSIH3PTDYQVNIQIIEYEKK 
QTLGQNDTGFSCDGTANTFRVMFKEPIEILPNVCYTACATLXGP 
DSHYGTKGLKKWHETPAASKTVFFFF5SPGNIINGTSIEDGQIP 
EIIFYT 


6924 


2210 


1235 


EERV - ^"EYYLTAFHEGRKGALAKKPYNPI IGETFHCSWEVP 
KDRVKPKRTASRSPASCHEHPMADDPSKSYKLRFVAEQVSHHPP 
ISCFYCECEEKRLCVNTHVWTKSKFMGMSVGVSMIGEGVLRLLE 
HGEE YVFTLPS AYARS I LT I PWVELGGKVS INCAKTGYSATVI F 
HTKPFYGGKVHRVTAEVfCHN'PTNTIVCKAHGEWN'GTLEFTYNNG 
ETKVIDTTTLPVYPKKIRPLEKQGPMESRNLWREVTRYLRLGDI 
DAATEQKRHLEEKQRVEERKRENLRTPWKPKYFIQEGDGSGILQ 
SPLESTLMGLEVQSFPV 


6925 


2 


1S53 


RGGAAGAAMEPDSVIEDKTIELMCSVPRSLWLGCANLVESMCAL 
SCLQSMPSVRCLQISNGTSSVIVSRKRPSEGNYQKEKDLGIKYF 
DQa'SESDQVEFVEHLISRMCHYQHGHINSYLKPMLQRDFITALP 
EQGLDH IAENILS YLDARSLCAAELVCKEWQRVIS EGMLWKKL I 
ro PLWKGLSERRGWDQYLFKNRPTDGP pns FYRSLYPKI 
IQDIETIESNWRCGRHNLQRIQCRSENSKGVYCLQYDDEKIISG 
LRDNSIKIWDKTSLECLKVLTGHTGSVLCLQYDERVIVTGSSDS 
TVRVWDVNTGEVLNTLIHHNEAVLHLRFSNGLMVTCSKDRSIAV 
WDMASATDITLRRVLVGHRAAVNWDFDDKYIVSASGDRTIKVW 
STSTCEFVRTLNGHKRGIACLQYRDRLWSGSSDNTIRLWDIEC 
GACLRVLEGHEELVRCIRFDNIOOTSGAYDGKIKWDLQAALDP 
RAPASTLCLRTLVEHSGRVFRLQFDEFQIISSSHDDTILIWDFL 
NVPPSAQNETRSPSRTYTYISR 


6926 


1 


733 


SGRVAMDGLGLQFPEQGFPAGPPLLPPHMGGHYRDCQSLGAPPL 



571 



WO 01/53312 



PCT7US00/34263 



SEQ 
ID 


Predicted 

nucleotide 

location 

corresponding 

amino acid 
residue of 
amino acid 


Predictea end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








YAGPPEPPAGPMHPRLGPEPAGPSIPGLLAPPSALHVYYGAMGS 

DPSQPAELLGEVDRTEFEQYLHFVCKPEMGLPYQGHDSGVNLED 
SHGAI SSWSDASSAVYYCNYPDV 


S927 






CQGFAWATDLSTDLESQLSVSCKCY3AANEILQFRDLKSQNPEH 
YVQVLKRMGNIRNEIGVFYMNQAAALQSERLVSKSVSAAEQQLW 
KKSFS CFEKGI HKtFE S I EDATNAALLLCNTGRLMRICAQAHCGA 
GDELKREFSPEEGLYYNKAIDYYLKAL^ELGTRDIHPAVWDSVN 
WELSTTYFTMATLQQDYAPLSRKAQEQIEKEVSEAMMKSLKYCD 
VDS VS ARQ PL CQ YRAAT I HHRLAS M YHS CLRNQVGDEHLRKQHR 
VLADLHYSKAAKLFQLLKDAPCELLRVQLERVAFAEFQMTSQNS 
NVGKLKTLSGALDIMVRTEHAFQLIQKELIEEFGQPKSGDAAAA 

IEDDTILKTNKHIYSQLLRATANKTATLLERIIIVIVHLLGQLAA 
GSAASSNAVQ 


6928 


1086 


777 


EAIDLINNLLQVKMRKRYSVDKTLSHPWLQDYQTWLDLRELECK 
ETEKKALGERVSIL 


6929 


1749 


607 


RDQRGYRDDRSPAREPGDVSARTRSGGGGGRSATTAMPPPVPN3 
NLHQHDPQDLRHNGNVWAGRPSCSRGPRRAIQKPQPAGGRRSG 
RGPAAGGLCLQPPDGGTCVPEEPPVPPMDWEALEKHLAGLQFRE 

SKSGKPSLSSRLQSGMNLQICFVNDSGSDKDSDADDSKTETSLD 
TPLSPMSKQSSSYSDRDTTEEESESLDDMDFLTRQKKLQAEAKM 
ALAMAKPMAKMQVEVEKQNRKKSPVADLLPHMPHISECLMKRSL 
KPTDLRDMTIGQLQVIVNDLHSQIESLNEELVQLLLIRDELHTE 
QDAMLVDIEDLTRHAESQQKHMAEKMPAK 


6930 


131 


545 


FKDTANVFVSLFQMRNNFRHYFIEPSQuKLFYDVITWIVTQVAI 
SYTWPFVLLSIKPSLTFYSSWYYCLHIL3ILVLLLLPVKKTQR 
RKNTHENIQLSQSKKFDEGEKSLGQNSFSTTNNVCNQNQEIASR 
HSSLKQ 


6931 




659 


FVERLPNRPACLLVASGAAEGVSAQSFLHCFTMASTAFNLQVAT 
PGGKAMEFVDVTE SNARWVQDFRLKAYAS PAKLES IDGAR YHAL 
LIPSCPGALTDLASSGSLARILQHFHSESKPICAVGHGVAALCC 
ATNEDRSWVFDSYSLTGPSVCELVKAPGFARLPLWEDFVKDSG 
ACFSASEPDAVHWLDRHLVTGQNASSTVPAVQNLLFLCGSRK 


6932 




1131 


FVDSPGQG3QAEEEEGGIQMNSRMRAHSPAEGASVESSSPGPKK 
SDMCEGCRSLAAGHPGYISHDKETSIKYVSHQHPSHPQLFSIVR 
QACVRSLS CEVCPGREGPIFFGDEQHGFVFSHTFFIKDSLARGF 
QRWYS 1 1 T I MMDRI YL INS WPFLLGKVRG I IDELQGKALKVFEA 
EQFGCPQRAQRMNTAFTPFLHQRNGNAARSLTSLTSDDNLWACL 
HTSFAWLLKACGSRLTEKLLEGAPTEDTLVQMEKLADLEEESES 
W3NSEAEEEEKAPVLPESTEGRELTQGPAESSSLSGCGSWQPRK 
LPVFKSLRHMRQVGGRGTAHHELRRRANHGLCLPTRLASGPSTL 
KTLQEVTDSLLGGWLMAQGVGGI I 


6933 


1431 


890 


SLNLHCTLPPPPHQYPAGYPSDKEGKKPKGQSKKQPSGTTKRPI 
SDDDCPSASKVYKASDSAEAIEAFQLTPQQQHLIREDCQNQKLW 
DEVLSHLVEGPNFLKKLEQSFMCVCCQELVYQPVTTECFHNVCK 
DCLQRSFKAQVFSCPACRHDLGQNYIMIPNEILQTLLDLFFPGY 
SKGR 


6934 


3030 

_ 


2588 


DRDHSQCGGIRRVALARVSSVKLISKAKIRTVKMTFIIVLAFIV 
CWTPFFFVQMWSVWDANAPKEASAFIIVMLIASLNSCCNPWIYM 
LFTGHLFHELVQRFLCCSASYLKGRRLGE7SASKKSNSSSFVLS 
HRSSSQRSCSQPSTA 
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SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signa_ peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-^Histidine, I = Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6935 


886 


543 


NSALYVAGGNDGTSCLNSVERYSPKAGAWESVAPMNIRRSTHDL 
VAMDGWLYAVGGNDGSSSLNSIEKYNPRTNKWVAASCMFTRRSS 
VGVAVLELLNFPPPSSPTLSVSSTSL 


6936 


1347 


567 


RSHRRQFLS^ALLEFFGKSHPPPHRLFRKSLNVGLHYSHIPFLT 
TCIiHFLRKRLQKGEVGLSVETSKPQVPVGGLSRKKVPQEPKATV 
MEKRLQEAQLYKEEGNQRYREGKYRDAVSRYHRALLQLRGLDPS 
LPGPLPNLGPQGPALTPEQENILHTTQTDCYNNLAA.CLLQMEPV 
NYERVREYSQKVLERQPDNAKALYRAGVAFFHLQDYDQARHYLL 
AAVNRQPKDANVRRYLQLTCSELSSYHRKEKQLYLGMFG 


6937 


' 


727 


AVEFRCCPGRDPACFARGWRLDRVY3TCFCDQACRFTGDCCFDY 
DRACPARPCFVGEWSPWSGCADQCKPTTRVRRRSVQQEPQNGGA 
PCPPLEERAGCLEYSTPQGQDCGHTYVPAFITTSAFNKERTRQA 
TSPHWSTHTEDAGYCMEFKTESLTPHCALENRPLTRWMQYLREG 
YTVCVDCQP PAMNS VS LRCSGDGLDSDGNQTLHWQAI GNPRCQG 
TWKKVRRVDQCSCPAVHS F I FI 


6938 


3 


719 


NSRKLELAERVDTDFMQLKKRRUbDf EJ i ]u klAWVDH 
EGNVAAAVSSGC3LAT.KHPGRVGQAALYGCGCWAENTGAHNPYST 
AVSTSGCGEHLVRT1LARECSHALQAEDAHQALLETMQNKFISS 
PFLASEDGVLGGVIVLRSCRCSAEPDSSQKKQTLLVEFLWSHTT 
ESMCVGYMSAQDGKAKTHI5RLPPCAVAGQSVAIEGGVCRLGEP 
SELTLQAECEASQRHFRT 


S939 


3 


810 


KVTAPRRPQRYSSGHGSDNSSVLSGELPPAMGRTALFHHSGGSS 
GYESLRRDSEATGSASSAPDSMSESGAASPGARTRSLKSPKKRA 
TGLQRRRLIPAPLPDTTALGRKPSLPGCWVDLPPPLAGSLKEPF 
E 1 K'J Y a 1UDVERLQRPRPTPREAPTQGLACVSTRLRLAERRQQR 
LREVQAKHKHLCEELAETQGRLMLEPGRWLEQFEVDPELEPESA 
EYLAALERATAALEQCWLCKAHVMMVTCFDISVAftSAAIPGPQ 
EVDV 


S940 


11BB 


496 


GKMAAQPLRHRSRCATPPRGDFCGGTERAIDQASFTTSMEWDTQ 
WKGSSPLGPAGLGAEEPAAGPQLPSWLQPERCAVFQCAQCHAV 
LADSVHLAWDLSRSLGAWFSRVTNNWLEAPFLVGIEGSLKGS 
TYNLLFCGSCGIPVGFHLYSTHAALAALRGHFCLSSDKMVCYLL 
KTKAIVKASEMDIQNVPLSEKIAELKEK1VLTHNRLKSLMKILS 
EVTPDQSKPEN 


6941 


1 


713 


SLSRADSDPHGPHTCGHVliNVIIGSNVLALAEAQRQAEAliGYQA 
WLSAAMQGDVKSMAQFYGLLAHVARTRLTPSMAGASVEEDAQL 
HELAAELQ I PDLQLEEALETMAWGRGPVCLLAGGEPTVQLQGSG 
RGGRNQELALRVGAELRRWPLGPIDVLFLSGGTDGqDGPTEAAG 
AOTTPELASQAAAEGLDIATFLAHNDSHTFFCCLQGGAHLLHTG 


6942 


1 


246 


GDYVERYDPKTDTWTMGAPLSMPTNAVGGCLLGDRLYADGGYDG 
QTYLNTMESYDPQTNEWTQMASLNIGRAGACWVIKQP 


6943 


1 


739 


PMATGDGAKTLAIHVKALTADSIRITWKATLPASSFRLSWLRLG 
HSPAGGSITETLVQGDKTEYLLTALEPKPTYIICMVTMETTNAY 
VADETPVCAKAETADSYGPTTTLNQEQNAGPMASLPLAGIIGGA 
VALVFLFLVLGAICWYVHQAGELLTRERAYNRGSRKKDDYMESG 
TKKDNSILEIRGPGLQMLPINPYRAKEEYWHTIFPSNGSSLCK 
ATHTIGYGTTRGYRDGGIPDIDYSYT 


6944 


960 ■ 


156 


VANILLNGVKYESELTGSSERAEQPLSVGRLCSTICNMPKALRT 
LCVHHFLGWLSFEGMLLFYTDFMGEWFQGDPKAPHTSEAYQKY 

FGLGTGLATLSRNLYWLSLCITYGILFSTLCTLPYSLLCDYYQ 
SKKFAGSSADGTRRGMGVD ISLLS CQYFLAQ ILVSLVLGPLTSA 
VGSAITGVMYFSSLVSFLGCLYSSLFVIYEIPPSDAADEEHRPLL 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine r D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanir.e, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=8top 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








HSKPGPARPVPLKKRGYDVTRNPHLNKGMAFTLEERLQLGIHGL 
IPPCFLSQDVQLLRIMRYYERQQSDLDKYIILMTLQDRNEKLFY 
RVLTSDVEKFMPIVYTPTVGIACQHYGLTFRRPRGLFITIHDKG 
HIATMLNSWPEDNIKAVWTDGERILGLGDLGCYGMGIPVGKLA 
LYTACGGVNPQQCLPVLLDVGTNNEELLRDPLYIGLKHQRVHGK 
AYDDLLDEFMQAVTDKFGINCLIQFEDFANANAFRLLNKYRNKY 
CMFNDDIQGTASVAVAGILAALRITKNKLSNHVFGFQGAGEAAM 
G\IAHLLVMALE\KEGVPKA\EATRKIW\MVDF\KGLIVQGRDH 
LNHEKEMFAQD\HPEVNSLEEWRLVKPTAIIGVAAIAEA\FTE 
QILRDMASFHERP\IIFALSNPTSKAECTA\EKCYRVTEGPRGF 
FAS\GSPF*GVLIWEMGKTFIPGGRGNNA*RVPRGWQLGVHSPG 
GDPGHIP\DEIFLPDSRAKLPQEVSEQHLSQGRLYP\PLST\IR 
NVFLRIAIKVFD*GYKHNLV\SYYPEPKD\KEAFCKIPGSYTPD 
YDS FYT/VDS Y IWAQGKAMNVQTV 


6946 


133 


2551 


sceysgitvapgdpcpgvahliiapsmasdtpeslmalctdfclr 
nldgtlgylldketlrlhpdiflpsei\cdrlvneyvelvnaac 
nf\ephe\sffnplfrdprkqpasrrihl\red\lvqd\qd\le 
airkqdl\vel \ yltn\ ceklsaks lqtlrs fshtlgvp * affg 

f\lklgrmidwvpves \llrplns laaldlsg i qtsdaa\ fltq 
wkdsl\vslvl\ynmdlsddiiir\v;vqlhklrhldisrdrlss 
yykfkltrevlslfvqklgnlmsldisg\hmilencsiskigkr 
eagqtsi\epsk\ssiipfrgfeggplqf\lgvf*gifcgrjth 
ipaykvsgdkneeqvlnaieaytehrpeitsrainllfdiarie 
rcnqllralklvitalkchkydrniqvtgsaalfyltnseyrse 
qsvklrrqviqwlngmesyqevtvqrnccltlcwfsipeeiief 
qyrrvnellls ilnptrqdes iqriavhlcnalvcqvdndhkea 
vgkmgfwtmlkltqkklldktccqtoefswxsalwmitdetpd 
ncemflnfngmklfldclnefpekqelhrnmlgllgntvaevkel 
rpqlmtsqfisvfsnlleskadgievsynacgvlshimfdgpea 
wgvcepqreeveermwaaiqswdinsrrninyrsfepilrllpq 

atarqetkemarkviehcsnfkeenmdtsr 


6947 


2 


1682 


TSVSTIPRGLASARPQSRSWRCCPVWRRSPGRARGRGLKMLKVP 
SQSFPAPRSQQRVASGGRSKVPLKQGRSLMDWIRLTKSGKDLTG 
LKGRLIEVTEEELKKHNKKDDCWICIRGFVYNVSPYMEYHPGGE 
DELMRAAGSDGTELFDQVHRWVNYESMLKECLVGRMAIKPAVLK 
DYREEEKKVLNGMLPKSQVTDTLAKEGPSYPSYDWFQTDSLVTI 

MRFRKIFLCGL/CESVGKIEIVLQKKEHTSWDFLGHPLKNHNSL 
IPRKDTGLYYRKCQLISKEDVTHDTRLFCLMLPPSTHLQVPIGQ 
HVYLKLPITGTEIVKPYTPVSGSLLSEFKEPVLPNNKYIYFLIX 
IYPTGLFTPELDRLQIGDFVSVSSPEGNFKISKFQELEDLFLLA 
AGTGFTPMVKILNYALTDIPSLRKVKLMFFNKTEDDIIWRSQL3 
KLAFKDKRLDVEFVLSAPISEWNGKQGHISPALLSEFLKRNLDK 
SKVLVCICGPVPFTEQGVRLLHDLNFSKNEIHSFTA 


6948 


104 


58 


PDGAHSFFPDEYFTCSSLCLSCGVGCKKSMNHGKEGVPHEAKSR 
CRYSHQYDNRVYTCKACYERGEEVSVVPKTSASTDSPWMGLAKY 
AWSGYVIECPNCGWYRSRQYWFGNQDPVDTWRTEIVHVWPGT 
DGFLKDNNNAAQRLLDGMNFMAQ3VSELSLGPTKAVTSHLTDQI 
APAYWRPNSQILSCNKCATSFKDNDTKHHCRACGEGFCDSCSSK 
TRPVPERGWGPAPVRVCDNCYEAR/TRPVSCYRGTSGR*RRRRT 
QETVE 


6949 


152 


4656 


GLRLCLSRPLTRPGDDSVGGSAMASGAGGVGGGGGGKIRTRRCH 
QGP1KPYQQGRQQHQGILSRVTESVKNIVPGWLQRYFNKNEDVC 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 

sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine / K=Lysine, 
L=Leucine, M=Methionine , M=Asparagir_e , 
P=Proline, Q=Glutamine, E=Arginine, 
S=Serine, T=Threonine, v= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








SCSTDTSEVPRWPENKEDHLVYADEESSKITDGRITPEPAVSNT 
EEPSTTSTAST\YPDVLTRVSLYRSHLNFSMLESPALHCQPSTS 
SAFPIGSSGFSLVKEIKDSTSQHDDDNISTTSGFSSRASDKDIT 
VSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFNLSAFGTLSP 
SLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQSKLRNTPYQA 
PVRRQMKAKQLSAQS YGVTSS TARR I LQSLEKMSS PLADAKRIP 
SIVSSPLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLMTPKPV 
SIATNRSVYFKPSLTPSGEFRKTNQRIDKKCSTGYEKNMTPGQN 
REQRESGFSYPNFSLPAANGLSSGVGGGGGKMRRSRHAFVASKP 
LEEEEMEGPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSS 
QALTNKVQMTSPSSTGSPMFKFSSPIVKSTEANV1PPSSIGFTF 
SVPVAKTAELSGSSSTLEPIISSSAHHVTTVNSTNCKKTPP3DC 
EGPFRPAEILKEGSVLDILKSPGFASPKIDSVAAQPTATSPVVY 
TRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCIAC 
QAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGT 
WDCDTCLVQNKPEAIKCVACETPKPGTCVKRALTLTWSESAET 
MTASSSSCTVTTGTLGFGDKFKRPIGSWECSVCCVSNNAEDNKC 
VSCMSEKPGSSVPTSSSSTVPVSLPSGGSLGLEKFKKPEGIWDC 

SFKFGV3SSSSGPSQTLTSTGNFKFGDQG3FKIGVSSDSGYINP 
MSEGF*FSKHIVGFKFGVSSESKPEEVKKDSKNDNFKFGLSFGL 
SNPVFLTPFQFGVaNLGQEEKKEELLKSSGAGFRFGTGVINSTR 
VPANTIVTSENKSSFNLGTIETKSVSVAPLKCQTSEAKKEEMPA 
TKGGFSFGNVEPASLPSASVFVLGRTEEKQQEPVTSTSLVFGEG 
KLTMKEPKC\QPVFSFGEFQRQTKDENSSKSTFSFSMTKPSEKE 
SEQPAKATFAFGAQTNTTADQGAAKPDLSYLNNSSSSSSTPATS 
AGGG \ 1 FGSSTS S SNPPVATFVFGQS SNPGSSS \AFGNTAES S T 
SQSLLFSQDSKLATTSSTGTAVTPFVFGPGASSNNTTTSGFGFG 
ATTTSSSAGSSFVFGTGPSAPSASPAFGAKQTPTFGQSQGASQP 
NPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGQQPSQ 
SAFGSGTTPNSSSAFQFGSSTTNFNFTNNSPSGVFTFGANSSTP 
AASAQPSGSGGFPFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIK 
TAVRRRK 


6950 


2585 


411 


PRPGSRSGLCRRAGERGAVRAGGLSRRTRAE* IMDELHYQDTDS " 
DVPEQRDSKCKVKWTHEEDEQLRALVRQFGQQDWKFLA8HFPNR 

IAKHLKGRLGKQCRERWHNHLNPEVKKSCWTEEEDRIICEAHKV 
LGNRWAEIAI<MLPGRTDNAVKNHWNSTIKRKVDTGGFLSESKDC 
KPPVYLLLELiEDKDGLQSAQPXEGQGSLLTNWPSVPPTIKEEEIJ 
S3EELAAATTSKEQEPIGTDLDAVRTPEPLEEFPKREDQEGSPP 
ETSLPYKWWEAAMLLXPAVGSSLSEALDLIESDPDAWCDLSKF 
DLPEEPSAEDS IWNSLVQLQASHQQQVLPPRQPSA\LVPSVTEY 
RLDGHTISDLSRSSRGELIPISPSTEVGGSGIGTPPSVLKRQRK 
RRVALSPVTENSTSLSFLDSCNSLTPKSTPVKTLPFSPSQFLWF 
WN "LELESPSLTSTPVCSQKVWTTPLHRDKTPLHQKHAAF 
VTPDQKYSMDNTPHTPTPFKNALEKYGPLKPLPQTPHLEEDLKE 
VLRSEAGIELIIEDDIRPEKQKRKPGLRRSPIKKVRKSLALDIV 
DEDMKLMMSTLPKSLSLPTTAPSNSSSLTLSGIKEDNSLLNQGF 
LQAKPEKAAVAQKPRSHFTTPAPMSSAWKTVACGGTRDQLFMQE 
KARQLLGRLKPSHTSRTLILS 


6951 


1940 


239 


3 -Tv;-RSLQALYCQLLSFLLILALTEALAFAIQEPSPRESL 
QVLPSGTPPGTMVTAPHSSTRHTSWMLTPNPDGPPSQAAAPMA 
TPT?RAEGHPPT\TPSPPSLRQ*PPPIIjKAP/SSTGPAPAAMAT 
TS S KPEGRPRGQAAPT I LLTKPPGATSRPTTAPPRTTTRRPPRP 
PGSSRKGAGNSSRPVPPAPGGHSRSKEGQRGRNPSSTPLGQKRP 
LGKIFQIYKGNFTGSVEPEPSTLTPRTPLWGYSSSPQPQTVAAT 
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>rresponding 



Predicted end 
nucleotide 



Air.inc acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
_ \=possible nucleotide inse rtion) 

TVPSNTSWAPTTTSLGPAKDKPGLRKAAQGGGSTFTSQGGTPDA 
TAASGAPVSP/PSCPSAFSAPPPR*PTGWPQP**LLAYCYP\CT 
SRPLSTSSGVFTAATGPTPAAFDTSVSAPSQGIPQGASTTPQAP 
THPSRVSESTISGAKEETVA\PSP*PTGCPVLSPQWYPQPQAIS 
STAWSPPGPGSLGQQGTSPMWPRGTNRSTEPPSA*ARWISPG*S 
WPSACPSPP\LCPADGVLflEEEEEDRQPGEQPEAYGNNTHHPGT 
TFQQAC\RGAAPGEIPVPLKPLRTQLSEPRSPANGDYRDTGMVP 

PESEGESGEMTDRYTIHSQLEHLQSKYIGT\ATPTPPSGSG\CE 
PTPRLVLLLHGPLRPSQLLRHCGE*EQSASPLQLDGKDASALWT 
A5RQARGELRLCLTTAVRGTSPSVSPVCQSS 
NWGKTRALASGKHVP PGKQTNPNKS / VHCDS * G* * RRETTQDES 
FSPHFRGKMGGW\KLEKELENTEQPVGGNEG«-EHEVTGNLNSD 
PLLELCQCPLCQLDCGSREQLIAHVYQHTAAVVSAKSYM\CPVC 
GRALSSPGSLGRHLLIHSEDQRSNCAVCGARFTSHATFNSEKLP 
BVLNMESLPTVHNEGPSSAEGKDIAFSPPWPAGILLVCNIICAA 
YRKLLEAQTPSVRKWALRRQNEPLEVRLQRLERERTAKKSRRDN 
ETPEEREVRRMRDREAKRLQRMQETDEQRARRLQRDUKAMRLKR 
AIETPEKRQARLIREREASRLKRRLEKMEMMLRAQFGQDPSAKA 
AIAAEMNFFQLPVSGVELDSQLLQKMAFEEQHS SSLH 
PPPPFI IFSHPREAGT*AG~^KRSGDSECSPPVEQ*A*TRAAAQN 
*PQR*RWTEGNSPQASAVATPGQGASPAAPRCTP*PSRRHRRLP 
PGARPPAG*AAPAPTKPWLAGPASAPQPGAAPLSPPAPPLIRTR 
*CAGAAARGRPRRDRSPRPRTPGGCSWSEPRTPPAVSASAQTPS 
DAG*AGGR*GQRQRPSTGR*PPGVGGAGRSHRREGTIPGNPHPR 
AS * RAGWQR* PGP / REWGL+E PQGEEMSGPGGPGGAPFNQVGSS 
VMQAMSTGI 

PPGRRQVRAQVAGAPVGHWGTRARQVKTGGRRRARRTMPFLGQD 
WRSPGWSWIKTEDGWKRCESCSQKLERBNNHCNISHSIILNSED 
GE I FNNEEHE YAS KKRKKDHFRNDTNTQS FYREKW1 YVHKESTK 
ERHGYCTLGEAFNRLDFSSAIQDIRRFNYWKLLQLIAKSQLTS 
LSGVAQKWYFNILDKIVQEVLDDHHWPRLIKDLLQDLSSTLCIL 
/N*RSREVCISGKHQYLDLPIRNYSRLATTATGSSDD*ASE\NG 
LTLSDLPLHMLNNILYRFSDGWDIITLGQVTPTLYMLSEDRQLW 
KKLCQYHFAEKQFCRHLILSEKGHIEWKLMYFALQKHYPAKEQY 
CDTLHFCRHCSILFWKDSGHPCTAADPDSCFTPVSPQHFIDLFK 



QTSTS IFASPTSPPVLGESVLQDNSFDLNNGSDAEQEE.METQSS 
DFPPSLTQPAPDQSSTIQLHPATSPAVSPTTSPAVSLWSPAAS 
PEISPEVCPAASTWSPAVFSWSPASSAVLPAVSLEVPLTASV 
TSPKASPVTSPAAAFPTASPANKDVSSFLETTADVEEITGEGLT 
ASGSGDVMRRRIATPEEVRLPLQHGWRREVRIKKGSHRWQGETW 
YYGPCGKRMKQFPEVIKYLSRNWHSVRREHFSFSPRMPVGDFF 
E2RDTPEGLQWVQLSAEEIPSRIQAITGKRGRPRNTEKARTKEV 
PKVKRGRGRPPKVKITELLNKTDHRPLKKLEAQETLNEEDKAKI 
AKSKKKMRQKVQRGECQTTIQGQARNKRKQETKSLKQKEAKKKS 
KAEKEKGKTKQEKLKEKVKREKXEKVKMKEKEEVTKAKPACKAD 
KTLATQRRLEERQRQQMILEEMKKPTEDMCLTDHQPLPDFSRVP 
GLTLPSGAFSDCLTIVEFLHSFGKVLGFDPAKDVPSLGVLQEGL 
LCQGDSLGEVQDLLVRLLKAALHDPGFPSYCQSLKILGEKVSEI 
PLTRD>A/ - SEII,RCFLMAYGVEPALCDRLRTQPFQAQPPQQKAAV 
LAFLVHELNGSTLIINEIDKTLESMSSYRKNKWIVEGRLRRLKT 
VLAKRTGRSEVEMEGPEECLGRRRSSRIMEVTSGMEEEEEEESI 
AAVPGRRGRRDGEVDATASSIPELERQIEKLSKRQLFFRKKLLH 
SSQMLRAVSLGQDRYRRRYWVLPYLAGIFVEGTEGNLVPEEVIK 
KETDSLKVAAHASLNPALFSMKMELAGSNTTASSPARARGRPRK 
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rresponding 
to first 
amino acid 



■responding 
to first 
amino acid 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 



H=Histidine, I=Isoleu< 
L=Leucine, M=Methionii 
P=Proline, Q=Glutamini 
S=Serine, T=Threonine, 
=Tryptophan, Y=Tyros: 
Codon, /=possible nuc! 
\=possibli 



K=Lysine 

R=Arginine , 
V=Valine, 

le, X=Unknown, *=Stop 
:otide deletion, 
m) 



TKPGSMQPRHLKSPVRGQDSEQPQAQLQPEAQLHAPAQPQEQLQ 
LQLQSHKGFLEQEGSPLSLGQSQHDLSQSAFLSWLSQTQSHSSL 
LSSSVLTPDSSPGKLDPAPSQPPEEPEPDEAESSPDPQALWFNI 
SAQMPCNAAPTPPPAVSEDQPTPSPQQLASSKPMNRPSAANPCS 
PVQFSSTPLAGLAPKRRAGDPGEMPQSPTGLGQPKRRGRPESKF 
FKQMEQRYLTQLTAQPVPPEMCSGWWKIRDPEMLDAMLKALHPR 
GIREKALHKHLNKHRDFLQEVCLRPSADP1FEPRQLPAFQEGIM 
SWSPKEKTYETDLAVLQWVEELEQRVIMSDLQIRGWTCPSPDST 
REDIAYCEHLSDSQEDITWRGRGRE3LAPQRKTTNPLDLAVMRL 
AALEQNVERRYLREPLWPTHEWLEKALLSTPNGAPEGTTTEIS 
YEITPRIRVWRQTLEP.CRSAAQVCLCLGQLERSIAWEKSVNKVT 
CLVCRKGDNDEFLLLCDGCDRGCHIYCHP.PKMEAVPEGDWFCTV 
CLAQQVEGEFTQKPGFPKRGQKRKSGYSLNFSEGDGRRRRVLLR 
GRES PAAGPRYSEEGLSPSKRRRLSMRNHHSDLTFCE 1 1 LMEME 
SHDAAWPFLEPVNPRLVSGYRRIIKNPMDFSTMRERLLRGGYTS 
SEEFAADALLVFDNCQTFNEDDSEVGKAGHIMKRFFE\SRWEEF 
YQGKQGQSVRQGRWGVTLWHLPPTFQTKTCHFHLLMLPWVOTOV 

RYNPDF 

KLIVAMPEPTKKEENEVPAPAPPPEEPSKEKEAGTTPAKDWTLV 
ETPPGEEQAKQNANSQLSILFIEKPQGGTVKVGEDITFIAKVKA 
EDLSEKPTINGSRKWMDLASKAGKIILQLKETFERHSRVYTFEKQ 
1 1 KAKDNFAGNYRCEVTYXDKFDS CS FDIiEVHESTGTTPN I D I R 
9AFKRSGEGQEDAGELDFSGLLKRR3VKQQEBEPQVDVWELI 
TKPSEYEKIAFQYESPTCSGMLKRLKRS1RKEKKSAAFAKILDP 
VYQVDKGGRVRFWELADPKLEVKWNKNGQELRPSTKYIFEDTR 
CQSILNIDNCQMTDDSEYYVTAGDEKCSTELLVREPPIMVTKQL 
EDTTDYCGERVELECE VS EDDAQVKWFKNGEE 1 1 LVQTRYRI RV 
EGKKH I LI IEGATKADAADYS VMTTGGQSSAKLSVDLKPLKI LT 
PLTDQTVNLGKEICLKCEISENIPGKWTKNGLPVQESDRLKWH 
KGRIHKLVIDHALTEDEGDYVFAPDAYNVTLPAKVHVIDPPKII 
LDGLDADNTVTVIAGHKLRLEIPISGEPPPKAMWSRGDKAIMEG 
SGRIRTESYPDSSTLVIDIAERDDSGVYHINLKN3AGEAHASIK 
VKWDFPDPPVAPTVTEVGDDWCIMNWEPPAYDGGSPILGYFIE 
RKKKQSSRWMRLNFDLCKETTFEPKKMIEGVAYEVRIFAVNA\I 
GISKPSMPSRPFVPLAVTSPPTLLTVDSVTDTTVTMRKRPPDHI 
GAAGLDGYVLEYCFEGSTSAKQSDENGEAAYDLPAEDWIVANKD 
LIDKTKFTITGLPTDAKIFVRVKAYNAAGASEPKYYSQPILVKE 
IIEPPKIHSPKHLKQTYIRRVGDRVILVIPFQGKPRPELTWKKD 
GAE IDKNQINIRNSETDTI I FIRKAERSHSGKYDLQVKVDKFVE 
TASIDIRIIDRPGPPQIVKIEDVWGRJWALTWTPPKDDGNAAIT 
GYTIQKADKKSMENLRVIEHTIEPVPHTELVIGNEYYFRVFSEN 
MCGLSEDATMTKESAVIARDGKIYKNPVYEDFDFSEAPMFTQPL 
VNRLCHSGYMATLNCSVRGNPKPKITWMKNKVAIVDDPRYRMF3 
NQC-VCTLEIRKPSPYDGGTYCCKAVNDLGTVEIECKLE VKV IAQ 
PRTSRVKTEGSQGSSAMDFSVKVDIEKEVTCPICLELLTEPLSL 
DCGH5FCQACITAKIKESVIISRGESSCPVCQTRFQPGNLRPNR 
HLANIVERVKEVKMSPQEGQKRDVCEHHGKKLQIFCKEDGKVIC 
WVCELSQEHQGHQTFRINEWKECQEKLQVALQRLIKENQEAEK 
LSDDIRQERTAWKNYIQIERQKILKGFNEMRVILDNEEQRELQK 
LIiECE\nTVLDNLAAATDQLVQQRQDAS TL I SDLQRRLRGSSVEM 
LQDVIDVMKRSESWTLKKPKSVSKKLKSVFRVPDLSGMLQVLKE 
LTDVQYYWVDVMLNPGSATSHVAISVDQRQVKTVRTCTFKNSNP 
CDFSAFGVFGCQYFSSGKYYWEVDVSGKIAWILGVHSKISSLNK 
RKSSGFAFDPSVNYSKVYSRYRPQYGYWVIGLQNTCEYNAFEDS 

SSSDFKVLTIiFMAV\LPWLGFS 

SLVHWEFGRGIEDFPYLFFQLTHCQQRICSVTQAGVQWCDHSS | 
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SEQ 
ID 
NO: 


beginning 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Iscleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








i r Er-LNQSSHLSLLSSRDYRMLSSFHEWFWQDRFNLPPNVT 
WT3LEDRDGRVYPHPQDLLAALPLALVLLAMRLAFERPIGLPLS 
RWLGVRDQTRRQVKPNATLEKHFLTEGHRPKEPQLSLLAAQCGL 
TLQQTQRWFRRRRNQDRPQLTKKFCEASWRFLFYLSSFVGGLSV 
LYHESWLWAPVMCWDRYPNQLTLSCPAADSEA\SLYWWYI,LELG 
FYLSLLIRLPFDVKRKGGGPSSIKPRPHYDPPSTA\DFKEQVIH 
HFVAVILMTFSYSANLLRIGSLVLLLHDSSDYLLEACKMVNYMQ 
YQQVCDALFLIFSFVFFYTRLVLFPTQILYTTYYESISNRGPFF 
GYYFFNGLLMLLQLLHVFWSCLILRMLYSFMKKGQMEKDIRSDV 


S96G 




2068 


AKWAREKEMQEF\TRSFF\RGRPDLSTLTHSIVRRRYLAHSGRS 
HLEPEEKQALKRLVEEEPLKMQVDEAASREDKLDLTKKGKRPPT 
PCSDPERKRFRFNSESESGSEASSPDYFGPPAKNGVASRSHTEP 
KEENPRRA\SKAVEESSDEERQRDLPAQRGEESSEEEEKGYKGK 
TRKKPWKKQAPGKASVSRKQAREESEESEAEPVQRTAKKVEGN. 

3TKSLKESE L ^ r EDF.F D 
PRTRSNGRRKSAREERSCKQKSQAKRLLGDSDSEEEQKEAASSG 
DDSGRDRE P PVQRKS EDRTQLKGGKRLSGS SEDEEDSGKGEPTA 

. KGRTRSSi 'DJSPEAKGGKAG3GRRGEDHPAVMRLKRYIR 
ACGAHRNYKKLLGSCCSHKERLSILRAELEALGMKGTPSLGKCR 
ALKEQREEAAEVASLDVANIISGSGRPRRRTAWNPLGEAAPPGE 
LYRRTLiDSDEERPRPAPPDWSHMRGIISSDGESN 


6961 


340 


1646 


RPVJSSPT.MKPKFSI.RLRIJNLNCWGIPYLSKHRADRMRRLGDFL 
NQESFDLALLEEVWSEQDFQYLRQKLSPTYPAAHHFRSGIIGSG 
LCVFSKHPIQELTQHIYTLNGYPYMIHHGDWFSGKAVGLLVLHL 
SGMVLNAYVTHLHAEYNRQKDIYLAHRVAQAWELAQFIHHTSKK 
ADWLLCGDLNMHPEDLGCCLLKEWTGLHDAYLETRDFKGSEEG 

TTGFDPHRGTPLSDHEALMATLFVRHSPPQQNPSSTHGP\AERS 
PL/MCVCLKEALDGSLGLGMA\QARWWA\TFA\SYVIGLGL\LL 
LALLCVLAAGGGAGRAATLLWTPSVGLVLWAGAFYLFHVQEVNG 
LYRAQAELQHVLGRAREAQDLGPEPQLYALL\LGQQEGDRTKEQ 


69S2 


340 


1646 


RPWSSPTMKPNFSLRLRIFNLNCWGIPYLSiaiRADRMRRLGOFL 
NQEEFDLALLEEVWSEQDFQYLRQKLSPTYPAAHHFRSGIIGS3 
LCVFSKHPIQELTQHIYTLNGYPYMlHHGDWFSGKAVGLLVjHL 
SGMVLNAYVTHLHAEYNRQKDIYLAHRVAQAWELAQFIHHTSKK 
ADWLLCGDLNMHPEDLGCCLLKEWTGLHDAYLETRDFKGSEEG 
NTMVPKNCYVSQQELKPFPFGVRIDYVLYKAVSGFYISCKSFET 
TTGFDPHRGTPLSDHEALMATLFVRHSPPQQNPSSTHGP\AERS 
PL/MCVCLKEALDGSLGLGMA\QARWWA\TFA\SYVIGLGL\LIj 
IALLCVIiAAGGGAGEAAILLWTPSVGLVLWAGAFYLFHVQEVNG 
LYRAQAELQHVLGRAREAQDLGPEPQLYALL\LGQQEGDRTKEQ 


6963 


3 74 


2618 


RVT PL I LKLLKKPKTAENQKASEENE I TQPGGS SAKPGLPCLWF 
EAVLSPDPALIHSTHSLTNSHAHTGSSDCDISCKGMTERIHSIN 
LHNFSNSVLETLWEQRNRGHFCDVTVRIHGSMLRAQRCVLAAGS 
PFFQDKLLLGYSDIEIPSWSVQSVQKLIDFMYSGVLRVSQSEA 
LQILTAASILQIKTVIDECTRIVSQNVGDVFPGIQDSGQDTPRG 
TPESGTSGQSSDTESGYLQSHPQHSVDRIYSALYACSMQNGSGE 
RSFYSGAWSHHETALGLPRDHHMEDPSWITRIHERSQQMERYL 
STTPETTHCRKQPRPVRIQTLVGNIHIKQEMEDDYDYYGQQRVQ 
ILERNESEECTEDtDQAEGTESEPKGESFDSGVSSSIGTEPDSV 
3QQFGPGAARDSQAEPTQPEQAAEAPAEGGPQTNQLETGASSPE 
RSNEVEMDSTV1TVSNSSDKSVLQQPSVNTSIGQPLPSTQLYLR 
QTETLTSNLRMPLTLTSNTQVIGTAGNTYLPALFTTQPAGSGPK 
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beginning 
nucleotide 
location 
corresponding 



residue of 
amino acid 
sequence 



Predicted end 
lcleotide 

corresponding 



sequence 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, W=Asparagine, 
P=Proline, Q=Glutamine, H=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
PFLFSLPQPLAGQQTQFVTVSQPGLSTFTAQLPAPQPLASSAGH 
STASGQGEKKPYECTLCNKTFTAKQNYVKHMFVHTGEKPHQCSI 
CWRS FS LKDYL I K\HMVTHTGVRAYQCS ICNKRFTQKS SLNVHM 
RLHRGEKSYECYICKKKFSHKTLLERHVALHSASNGTPPAGTPP 
GARAGPPGWACTEGTTYVCSVCPAKFDQIEQFNDHMRMHVSD3 



nvfiepriqgfmktsahpgqkhpdfsmgllfpllaalevcscgs 
sgslgynlpqnh\gllgrijtlvllgqmrrispflclkdrsdfrf 
pqekvevsqlqka\qamsflydvlqqvfnfshkall\ccmehdl 
pgptphftssaagtpgdllgagdgrrrswgqwvi3gstlalrry 
fqesistle 

iitalgvrgmpgcpcpgcgmagprllfltalalellgraggsqp 
alrsrgtatacrldnkeseswgallsgerldtwicsllgslmvg 
lsgvfpllviplemgtmlrseagawrlkqllsfalggllgnvfl 
hllpeawaytcsaspggegqslqqqqqlglwv:agiltflalek 
/hvpgqqgggdqpgpqqrphcccrraqwrplsgpagcrarprcr 
gp\dikvsgylnllantidnfthglavaasflvs:<kigllttma 
illheiphevgdfaillragfdrwsaaklqlstalggllgagfa 
ictqspkgveetaawvlpftsggflyialtovlpdlleeedpw 
gflpfkywildlsassrmetdcnpmei.ssmsgfeegsft,ngfeg 
tdmkdmrleaeawndvlfavnwnfvskslrcaddvayinvetk 
ernryclelteaglkwgyafdqvddhlctpyhetvyslldtl\ 

spayreafgkr\l lqelealkrdgqs 

rgggggrggpgarererpgepertmeaaaggrgcfqphpglqkt 
leqfhlssmsslggpaafsarwaqeaykkesakeagaaavpapv 

PAATEPPPVLHIjPAIQPPPPVLPGPFFMPSDRSTERCETVLEGE 
TISCFWGGEKRLCLPQILNSVLRDFSLQQINAVCDELHIYCSR 
CTADQLE ILKVMGI LPFSAP SCGL ITKTDAERLCNALLYGGAYP 
PPCKKELAASLALGLELSERSVRVYHE\CFGKCKGL\LVPELYS 
SPSAACIQCLD\CRLMYPPHKFWHSHKALENRTCHWGF\DSA\ 
NWRAYILLSQDYTGKEEQARLGR\CLDDVKEKFDYGNKYKRRVP 
RVS5EPPASIRPKTDDTSSQSPAPSEKDKPSSWLRTLAGSSNKS 
LGCVHPRQRLSAFRPWSPAVSASEKELSPHLPALIRDSFYSYKS 
FETAVAPNVALAPPAQQKWSSPPCAAAVSRAPEPLATCTQPRK 



PGARALPSAVPDAAAPADAPSGLEAEL 
EHLRQALEGGLDTKEAKEKFLHEWKMRVKQEEKLSAALQAKRS 
LHQELEFLRVAKKEKLREATEAKRNLRKEIERJjRAEHEKKMKEA 
NESRLRLKRELEQARQARVCDKGCEAGRLRAKYSAQIEDLQVKL 
QHAEADREQLRADLLREREAREHLEK\WK\ELQEQLKPRARPE 
AAGSEG\AAELEP 



AGTMHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFOKRQA 
GELDESVLELTSQI LGANPDFATLWNCRREVLQQLETQKS PEEL 
AALVKAELGFLESCLRVNPKSYGTWHHRCWLLGRLPEPNWTREL 
ELCARFLEVDERNFHCWDYRRFVATQAAVPPAEELAFTDSLITR 
NFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPEDVLLKELELVQN 
AFFTDPNDQSAWFYHRWLLGRADPQDALRCLHVSRDEACLTVSF 
SRPLLVGSRMEILLLMVDDSPLIVEWRTPDGRNRPSHVWLCDLP 
AASLNDQLPQHTFRVIWTAGDVQKECVLLKGRQEGWCRDSTTDE 
QLFRCELSVEKSTVLQSELESCKELQELEPENKWCL\LTIILLM 
RALDPLLYEKETLQYFQTLK\AWDP3CRATY\LDDLRSKFLIiENS 
VLKMEYAEVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRTL 
PPALAALRCLEDPPPRT\VLQASDNAIESLDGVTNLPRLQELLL 
CNNRLQQPAVLQPLASCPRLVLLNLQGNPLCQAVGILEQLAELL 
PSVSSVLT 
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Predicted 
beginning 

location 
corresponding 



Predicted end 
nucleotide 
location 
corr e sponding 
to first 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
^poss ible nucleotide insertion) 



FPPLLSSPSAVGEGKVAVAAPCPGRSECARAKMAYIQLEPLNE 
GFLSRISGLLLCRWTCRH2CQKCYESSCCQSSSDEVEILGPFPA 
QTPPWLMASRSSDKDGDSVHTASEVPLTPRTNSPDGRRSSSDTS 
KSTYSLTRRISSLESRRPSSPLIDIKPIEFGVLSAKKEPrQPSV 
LRRTYNPDDYFRKFEPHLYSLDSNSDDVDSLTDEEILSKYQLGM 
LHFSTQYDLLHNHLTVRVIEARDLPPPISHDGSRQDMAHSNPYV 
KICLLPDQKNSKQTGVKRKTQKPVFEERYTFEIPFLEAQRRTLL 
LTWDFDKFSRHCVIGKV3VPLCEVDLVKGGHWWKALIPSSQNE 
VELGELLLSLNYLPSAGRLNVDVIRAKQLLQTDVSQGSDPFVKI 
QLVHGLKLVKTKKTSFLRGTIDPFYNESFSFKVPQEELENASLV 
FTVFGHNMKSSNDFIGRIVIG\QYSSGP\SEPNHWRRMLNTHRT 
AVEQWHSLRSRAECDRVSPASLEVT 



ACFYVPGSRSFKLIPRHGLVNMGRSGKLPSGVSAKLKRWKKGHS 
SDSNPAICRHF.QAARSRFFSRPSGRSDLTVDAVKLHNELQSGSL 
RLGKSEAPETPMEEEAELVLTEKSSGTFLSGLSD^TNVTFSKVQ 
RFWESNSAAHKEICAVLAAVTEVIRSQGGKETETEYFAALIRKA 
AQHGVCSVLKGSEFMFEKAPAHHPAAISTAKFCIQEIEKSGGSK 
EATTTLHMLTLLKDLLPCFPEGLVKSCSETLLRVMTLSHVLVTA 
CAMQAFHSLFHARPGLSTLSAEL1MAQIITALYDYVPSENDLQPL 
LAWLKVMEKAHINLVRLQWDLGLGHLPRFFGTAVTCLLSPHSQV 
LTAATQSLKEILKECVAPHMADIGSVTSSASGPAQSVAKMFRAV 
EBGLTYKFHAAWSSVLQLLCVFFEACGRQAHPVMRKCLQSLCDL 
RLSPHFPHTAALDQAVGAAVTSMGPEWLQAVPL3IDGSEETLD 
FPRSWLLPVIRDHVQETRLGFFTTYFLPLANTLKSKAMDLAQAG 
STVHSKIYD'ILUU.),' . bLPGt I'RPTDVAISFKGLARTLGMAI 
SERPDLRVTVCQALRTL I TKGCQAEADRAEVSRFAKNFLP I LFN 
LYGQPVAAGDTPAPRRAVLETIRTYLTITDTQLVNSLLEKASEK 
VLDPASSDFTRLSVLDLWALAPCADEAAISKLYSTIRPYLESK 
AHGVQKKAYRVLEEVCASPQGPGALFVQSHLEDLKKTLLDSLRS 
TSSPAKRPRLKCLEHIVRKLSAEHKEFITALIPEVILCTKEVSV 
GARKNAFALLVEMGHAFLRFGSNQEEALQCYLVLIYPGLVGAVT 
MVSCSILALTHLLFEFKGLMGTSTVEQLLENVCLLIASRTRDW 
KSALGFIKVAVTVMDVAHLAKHVQLVM3AIGKLSDDMRRHFRMK 
LRNLFT\KFIPK\FGILTWGKKAVGPKEYHRVLVHIRKAEARAK 
RHRALSQAAVEEEEEEEEEEEPAQGKGDSIEEILADSEDEEDNE 
EEERSRGKEQRKLARQRSRAWLKEGGGDEPLNrLDPKVAQRVIiA 
TQPGPGRGRKKDHS FKVSADGRLI IREEADGNKMEEEEGAKGED 
EEMADPMEDVIIRNKKHQKLKHQKEAEEEELEIPPQYQAGGSGI 
HRPVAKKAMPGAEYKAKKAKGDVKKKGRPDPYAYIPLNRSKLNR 
RKKMKLQGQFKGLVKAAQRGSQVGHKNRRKDRRP 



PGGAILLPLWRRTRPREATVPRGAAQRGRARSAEGRIPSSQSPS 
PAEAGGATRSPPPRPPRPARPPGPSAPPLLRSDAGPGATVSAAA 
AAATERARRGATMGAQLSTLGHMVLFPWFLYSLLMKLFQRST? 
AITLESPDIKYPLRLIDREIISHDTF.RFRFALPSPQHILGLPVG 
QHIYLSARIDGNLVVRPYTPISSDDDKGFVDLVIKVYFKDTHPK 
FPAGGKMSQYLESMQIGDTIEFRGPSGLLVYQGKGKFAIRPDKK 
SNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLF 
ANQTEKDILLRPELEELRNKHSARFKLWYTLDRAPEAWDYGQG\ 
FVNEEMIRDHLPPPE\EEPLVLMCGPPPMIQYACLPNL\DHVGH 
PTERCFVF 



LQPRCAHRGLRAQKCGRPAPGVDAMVLCPVIGKLLHKRWLASA" 
SPRRQEILSNAGLRFEWPSKFKEKLDKASFATPYGYAMETAKQ 
KALEVANRLYQKDLRAPDWIGADTIVTVGGLILEKPVDKQDAY 
RMLSRFE/SGREHSVFTGVAIVHCSSKDHQLDTRVSEFYEETKV 
KFSELSEELLWEYVHSGEPMDKAGGYGIQALGGMLVESVHGDFL 
NWGFPLNHFCKQLVKLYYPPRPEDLRRS VKHDS I PAADTFEDL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

to first 
amino acid 
residue of 

sequence 


Predicted end 

location 
corresponding 

amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SDVEGGGSEPTQRDAGSRDEKAEAGEAGQATAEAECHRTRETLP 
PFPTRLLELIEGFMLSKGLLTACKLKVFDLLKDEAPQKAADIAS 
KVDASACGMERLLDICAAMGLLEKTEQGYSNTETANVYLASDGE 
YSLHGFIMHNNDLTWNLFTYLEFAIREGTNQHHRALGKKAEDLF 
QDAYYQSPETRLRFMRAMHGMTKLTACQVATAFNLSRFSSACDV 

IHFAAGDFFRDPLPSAELYVLCRILHDWPDDKVHKLLSRVAESC 
KPGAGLLLVETLLDEEKRVAQRALMQSLNMLVQTEGKERSLGEY 
QCLLELHGFHQVQWHLGGVLDAIL\PPKWPPEAQAACSL 


S974 


3082 


2172 


RSCAAFASFASRPPLELFAPPGSHRSPPGRGVATSAQCALSVRK 
LLAARPGLGTKYQATMVYKTLFALCILTAGWRVQSLPTSAPLSV 

TSLLPKNISIESREEEITSPGSNWEGTNTDPSPSGFSSTSGGVH 
LTTTLEEHSLGTPEAGVAATLSQSAAEPPTLISPQAPASSPSSL 
STSPPEVFSASVTTNHSSTVTSTQPTGAPTAPESPTEESSSDHT 
PTSHATAEPVPQEKTPPTTVSGKVMCELIDMET\PPPFPG 


6975 




500 


RPRPTVHCCKWALKLETAMBTLINVFHAHSGKEGDKYKLSKKEL 
SLPPAPQPPPYL*1>SAVPFPIHLPLPLLPPQAQKDVDAVDKVMK 


S97S 


1215 


970 


GCQL*VAYGTTENSPVTFAHFPEDTVEQKAESVGRIMPHTEARI 
MNMEAGTLAKLNTPGELCIRGYCVMLGYWGEPQKTEEAVDQDKW 
YWTGDVATMMEQGFCKIVGRSKDMIIRGGENIYPAELEDPFHTH 
PKVQEVQWGVKDDRMGEE I CAC IRLKDGEETTVEE I KAFCKGK 
ISHFKIPKYIVFVTNYPLTISGKIQKFKLREQMERHLNL*IKQQ 
ACPGRLA 


S977 


1298 


588 


SLFINTNLLSNQIRKTSFGMCSEPI3DNTEDQKGKLKTPDFA*R 
ANKKSKHHVWGNRTVEPFPEGTQMAVFGMGCFWGAERKFWVLKG 
VYSTOVGFAGGYTSNPTYKEVCSEKTGHAEWRWYQPEHMSFE 
ELLKVFWENHDPTQGMRQGNDHGTQYRSAIYPTSAKQMEAALSS 
KENYQKVLSEHGFGPITTDIREGQTFYYAEDYHQQYLSKNPNGY 
CGLGGTGVS CP VG I KK 


6978 


3 


242 


FAIRWGFIPLVIYLGFKRGADPGMPEPTVLSLLWG 


G979 


3917 


1146 


DEARVRGEAVAAAILSRCRHWSGPPPFPFSPPDRKGLRGTEPWE 
AGPGSGATPGARAMDVRRLKVNELREELQRRGLDTRGLKTELAE 
RLQAALEAEEPDDERELDADDEPGRPGHINEEVETEGGSELEGT 
AQPPPPGLQPHAEPGGYSGPDGHYAMDNITRQNQFYDTQVIKQE 
NESGYERRPLEMEQQQAYRPEMKTEMKQGAPTSFLPPEASQLKP 
DRQQFQSRKRPYEENRGRGYFEHREDRRGRSPQPPAEEDEDDFD 
Dl'LVAiDTYNCDl/HFKVARDRSSGYPLTIEGFAYLWSGARASYG 
VRRGRVCFEMKINEEISVKHLPSTEPDPHWRIGWSLDSCSTQL 
GEEPFSYGYGGTGKKSTNSRFENYGDKFAENDVIGCFADFECGN 
DVELS FTKNGKWMGIAFR IQKEALGGQALYPHVLVKNCAVEFNF 
GQRAEPYCSVLPGFTFIQHLPLSERIRGTVGPKSKAECEILMMV 
GLPAAGKTTWAIKHAASNPSKKYNILGTNAIMDKMRVMGLRRQR 
NYAGRWDVLIQQATQCLNRLIQIAARJCKRNYILDQTNVYGSAQR 
RKMRPFEGFQRKAIVICPTDEDLKDRTIKRTDEEGKDVPDHAVL 
EMKANFTLPDVGDFLDEVLFIELQREEADKLVRQYNEEGRKAGP 
PPEKRFDNRGGGGFRGRGGGGGFQRYENRSPPGGNRGGFQNRGG 
GSGGGGNYRGGFNRSGGGGYSQNRWGNNNRDNNNSNNRGSYNRA 
PQQQPPPQQPPPPQPPPQQPPPPPSYSPARNPPGASTYNKNSNI 
PGSSANTSTPTVSSYSPPQSFGFFPSTFQPSYSQPPYNQGGYSQ 
GYTAPPPPPPPPPAYNYGSYGGYNPAPYTPPPPPTAQTYPQPSY 
NQYQQYAQQWNQYYQNQGQWPPYYGNYDYGSYSGNTQGGTSTQ 


6980 


1 


420 


GTRGRKTGRVAAPSTRRRTGNMQKLQTRSPAMSLSDPGLGYHPT 
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SEQ 

ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 

amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 








CWTLRWPPLCSLHALHVFHCLFSSRLGTPVSPRLAMDPMCSCEA 
GGSCACAGSCKCKKCKCTSCKKSCCSCCPLGCAKCAQGCICKGA 
SEKCSCCA 


6981 


10 


1054 


PGRGFRRASLRPAFAARGVFQGGLGQAKQARTRACAALPTPHPS 
APRLLEPQGVFSLFPPPPGPWPNMILTKAQYDEIAQCLVSVPPT 
RQSLRKLKQRFPSQSQATLLSIFSQEYQKHIKRTHAKHHTSEAI 
ESYYQRYLNGWKNGAAPVLLDLANEVDYAPSLMARLILERFLQ 
EHEETPPSKSIINSMLRDPSQIPDGVLAMQVYQCIVNDCCYGPL 
VDCIKHAIGHEHEVLLRDLLLEKNLSFLDEDQLRAKGYDKTPDF 
ILQVPVAVEGHIIHWIESKASFGDECSHHAYLHDQFWSYWNRFG 
PGLVIYWYGFIQELDCNRERGILLKACFPTNIVTLCHSIA 


S982 


153 




PSLLQGLQGWAQAPVDGTLGPEDSRASSPMIQNSRPSLLQPQDV 
GDTVETLMLHPVIKAFLCGSISGTCSTLLFQPLDLLKTRLQTLQ 
PSDHGSRRVGMLAVLLKWRTESLLGLWKGMSPSIVRCVPGVGI 
YFGTLYSLKQYFLRGHPPTALESVMLGVGSRSVA3VCMSPITVI 

ktryesgkygyesiyaaliRstyhseghrglfsgltatllkja^i' 
sgi ylmfynqtkn i vphdqvdatl ipitnfs cgifagi laslvt 

QPADVIKTHMQLYPLKFQWIGQAVTLIFKDYGLRGFFQGGIPRA 
LSRTLKAAMAWTV YE EMMAKMGLKS 


S983 


32 


773 


EKSFLQDPSFFTMGMWSIGAGALGAAALBiLIoANTDVFLSKPQK 
AALEYLEDIDLKTLEKEPRTFKAKELWEKNGAVIMAVRRPGCFL 
CREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDFQPYFKGE 
IFLDEKKKFYGPQRRi'UyiMFMGFIRLGVWYNFFRAWNGGFSGNLE 
GEGFILGGVFWGSGKQGILLEHREKEFGDKVNLLSVLEAAKMI 
KPQTLASEKK 


6984 


1B45 


1282 


GGRSAYSLPAGSLPRVPATAAAKMASGVQVADEVCRIFYDMKVR 

BPFKHFVGMLPEKDCRYALYDASFETKESRKEELMFFLWAP3LA 
PLKSKMIYASSKDAIKKKFQGIKHECQANGPEDLNRACIAEKLG 
GSLIVAFEGCPV 


6985 


1887 


1324 


RRTAGIYPCFPKPGRTRHALCSWLLLLTGQLAFDDFQESCAMM 
WQKYAGSRRSMPLGARILFHGVFYAGGFAIVYYLIQKFHSRALY 
YKLAVEQLQSIIPEAQEALGPPLNIHYLKLIDRENFVDIVDAKLK 
IPVSGSKSEGLLYVHSSRGGPFQRWHLDEVFLELKDGQQIPVFK 
LSGENGDEVKKE 


6986 


642 


1350 


YHLYFKMGDPNSRKKQALWRIjRAQLRKKKESLADQFDFKMYIAF 
VFKEKKKKSALFEVSEVIPVMTNNYEENILKGVRDSSYSLESSL 
ELLQKDWQLHAPRYQSMRRDV1GCTQEMDFILWPRNDIEKIVC 
LLFSRWKESDEPFRPVQAKFEFHHGDYEKQFLHVLSRKDKTGIV 
VNNPNQSVFLFIDRQHLQTPKNKATIFKLCSICLYLPQEQLTHW 
AVGTIEDHLRPYMPE 


6987 


1623 


341 


lieaaeicasrafkesqrqtdsknyetenwspqksqrrydmyntac 
flge1evglytiqilqltpffhkenelskkhmvqflsgkwtipp 
dprnecyialskftshlknlqsdlkrcfdffidymvllkmrytq 
keiaeimlskkvsrcfrkytelfchldpcllqskesqllqeenc 
rkklealradrfaglleylnpnykdattmesivneyafllqqns 
kkpmtnekqnsilaniilsclkpnskliqplttlkkqlrevlqf 
y-;lskqypgpyflacllfwpenqeldqdskliekyvsslnrsfr 
gqykrmcrskqastlfylgkrkglns ivhkakieqyfdkaqntw 
slwhsgdwkknevkdllrrltgqaegklisveygteekiktpv 
isvysgplrsgrniervsfylgfsiegppgl 


6988 


3 


689 


tq_lrrpavfvgsaasgirsgi,wsassghwcapaagrahapvpr 
lvrglgaastaapqdaqtgpqpmpradcimrhlpyfcrgqwrg 
fgrgskqlgiptanfpeqwdnlpadistgiyygwasvgsgdvh 
kmws igwnpyykntkksmethimhtfkedfygeilmvaivgyl 
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SEQ 
ID 
NO: 


Predicted 
nucleotide 
corresponding 

residue of 


Predicted end 
nucleotide 

corresponding 

anino acid 

sequence 1 ' 3 


Anino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








RPEKNFDSLESLISAIQGDIEEAKKRLELPEHLKIKEDNFFQVS 
KSKIMNGH 


6989 


2 


1118 


LMPSDRPLSPSTHASAGSHCHAPPTTAREAFPIPFGSKSNMATL 
KDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDLADEL 
ALVDVIEDKLKGEMMDLQHGSLFLRTPKIVSGKDYNVTANSKLV 
1 1 TAGARQQEGESRLNLVQRNVNIFKFI I PNWKYSPNCKLLI V 
SNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGV 

EQWKEVHKQWESAYEVIKLKGYTSWAIGLSVADLAESIMKNLR 
RVHPVSTMIKGLYGIKDDVFLSVPCILGQNGISDLVKVTLTSEE 
EARLKKSADTLWGIQKELQF 


S990 


719 


258 


THASGMASWLALRTRTAVTSLLSPTPATALAVRYASKKSGGSS 
KNLGGKSSGRRQGIKKMEGHYVHAGNIIATQRHFRWHPGAHVGV 
GKNKCLYALEEGIVRYTKEVYVPHPRNTEAVDLITRLPKGAVLY 
KTFVHWPAKPEGTFKLVAML 


S991 


169 


451 


I 1 HHO^ICSTKNKRRNPKKIAYLL 

CGSHVK 


6992 


944 


510 


RQAPGCSSLALRQVRQVYCGLVRAPQVQTRPLSSRFVERRGALY 
RS PMNC ENP P P YPGPG PTAPYP P YP PQPMGPGPMGGPYPP PQGY 
PYQGYPQYGWQGGPQEPPKTTVYWEDQRRDELGPSTCLTACWT 
ALCCCCLWDMIiT 


6993 




374 


rj 'PQHNARQGPAVPPGIQAYGAAFFEDLQVDFTEMSKCRG 
DRVWIKNWNVASLCPLWKGPQTWLSPPTAVKVEGIPAWIHHSH 
VKPAARETWEARPSPDNPFRVTLKKTTSPAPVTPGS 


S994 


346 


1100 


QWPEKDPVMAASSISSPWGKHVFKAILMVLVALILLHSALAQSR 

S S QVLWAI S SAI SVAFFALSG I AAQLLNALGLAGDYLAQGLKLS 
PGOVQTFLLWGAGALWYWLLSLLLGLVLALLGRILWGLKLVIF 
LAGFVALMRSVPDPSTRALLLLALLILYALLSRLTGSRASGAQL 
EAKVRGLERQVEELRWRQRRAAKGARSVEEE 


5995 


144 


1346 


GSVAVGLSGIMAAQKDLWDAIVIGAGIQGCFTAYHLAKHRKRIL 
LLEQFFLPIISRGSSIIGQSRriRKAYLEDFYTRMMHECYQIWAQL 
EHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCLSS 
EELKQRFPNIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLG 
GIVRDGEKWEINPGLLVTVKTTSRSYQAKSLVITAGPWTNQLL 
RPLGIEMPLQTLRIWVCYWREMVPGSYGVSQAFPCFLWLGLCPH 

SSFVRDHLPDLKPEPAVIESCMYTNTPDEQFILDRHPKYDNIVI 
GAGFSGHGFKLAPWGKILYELSMKLTPSYDLAPFRISRFPSLG 
KAHL 


6996 


543 


1942 


ETANAEAAARKSAMDWKEVLRRRLATPNTCPNKKKSEQELKDEE 
MDLFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPA3NEVLLQKLR 
EESRAVFLQRKSRELLDNEELQWLWFLLDKHQTPPMIGEEAMIN 
YENFLKVGEKAGAKCKQFFTAKVFAKLLHTDSYGRISIMQFFNY 
VMRKVWLHQTRIGLSLYDVAGQGYLRE3DLENYILELIPTLPQL 
DGLEKSFYSFYVCTAVRKFFFFLDPLRTGKIKIQDILACSFLDD 
LLELRDEELSKESQETNWFSAPSALRVYGQYLNLDKDHNGMLSK 
EELSRYGTATMTNVFLDRVFQECLTYDGEMDYKTYLDFVLALEN 
RKEPAALQYIFKLLDIENKGYLNVFSLNYFFRAIQELMKIHGQD 
PVSFQDVKDEIFDMVKPKDPLKISLQDLINSNQGDTVTTILIDL 
NGFWTYENREALVANDSEUSADLDDT 


6997 


370 


1104 


AMELT I FILRLAI YILTFPLYLLNFLGLWSW I CKKWFP YFLVRF 
TVI YKEQMAS KKRELFSNLQE FAGPSGXLSLLEVGCGTGANFKF 
YPPGCRVTCIDPNPNFEKFLIKSIAENRHLQFERFWAAGENMH 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 

amino acid 
residue of 
amino acid 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 

L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QVADGSVDWVCTLVLCSVKNQERILREVCRVLRPGGAFYFMEH 
VAAECSTWNYFWQQVLDPAWHLLFDGCNLTRESWKALERASFSK 
LKLQHIQAPLSWELVRPHIYGYAVK 


6998 


2 


G16 


FVSRALLRVRSRRHPAEERAAPGRPEDAPIECPGAT'NCPEPLWC 
SHLPVPYAPPTMESRGKSASSPKPDTKVPQVTTEAKVPPAADGK 

LKPAPTVPSSPDATPEPKGPGDGAEEDEAASGGPGGRC-PWSCEN 
FNPLLVAGGVAVAAIALILGVAFLVRKK 


6999 


14 


1591 


GRAGACSRRDTAMSIEIESSDVIRL_MQYLKENSLHRALATLQE 
ETTVSLNTVDSIESFVADINSGHWDTVLQAIQSLKLPDKTLIDL 
YEQWLELIELRELGAARSLLRQTDPMIMLKQTQPERYIHLENL 
LARSYFDPREAYPDGSSKEKRRAAIAQALAGEVS 1 /VPFSRLMAL 

IKFGQKSHVECARFSPDGQYLVTGSVDGFIEVWNFTTGKIRXDL 
KYQAQDNFMMMDDAVLCMCFSRDTEMLATGAQDGKIKVWKIQSG 
QCLRRFERAHSKGVTCLSFSKDSSQILSASFDQTIRIHGLKSGK 
TLKEFRGHSSFVNEATF'l D ^YTTSASSDGTVKIWNMKTTECS 
NTFKSLGSTAGTDITVNSV1LLPKNPEHFWCNRSMTVVIMMMQ 
GQIVRSFSSGKREGGDFVCCALSPRGEWIYCVGEDFVLYCFSTV 
TGKLERTLTVHEKDVIGIAHHPHQNLIATYSEDGLLKLWKP 


7000 


2 


827 


GPGWFLELMESEGPPESERSEFFSQREEENEEEEAQEPEETGP 
KNPLLQPALTGDVEGLQKIFEDPF.NPHHEQAMQLLLEEDIVGRN 

ETLKALVELDVDIEALNFREERARDVAARYSQTECVEFLDWADA 
RLTLKKYIAKVSIAVTDTEKGSGKLLKEDKNTILSACRAKNEWL 
ETHTEASINELFEQRQQLEDIVTPIFTKMTTPCQVKSAKSVTSH 
DQKRSQDDTSN 








FASQVQAIFILPKDSQVGPDVKSEAAPKRALYESVFGSGEICGP 
TSPKRLCIRPSEPVDAWVVSVKHDPLPIjIjPEANGHRSTNS PTI 
VSPAIVSPTQDSRPNMSRPLITRSPASPLNNQGIPTPAQLTKSN 
APVHIDVGGHMYTSSLATLTKYPESRIGRLFDGTEPIVLDSLKQ 
HYFIDRDGQMFRYILNFLRTSKLLIPDDFKDYTLLYEEAKYFQL 

SLIEEVFPEIGDVMCNSVNAGWNHDSTHVIRFPLNGYaiLNSVQ 
VLERLQQRGFE I VGS CGGGVDS S QFSE YVLRRELRRTPRVP S VI 
RIKQEPLD 


7002 


1043 


498 


PMPSSTRWTTS*TYTDTSSAWACRPTTGTCT*TAAPGFl'VHtfWP 
TPCSRHQSRRRLTCWCSTSRPCGR*GGLCVRTAP7RPTTSASS3 
SWTSAGTSWPAGRRTGTATSGTATTTSVWPGCGTRMWSTQWSSV 
PRSRSCCSRPATTPPSKPGAPHAPCASSRHLAKGLAPSSPGLPA 
RGAEVC 


7003 




61 


QGRFRA7CWQRDFLQPPGMRLSALLALASKVTLPPHYRYGMSPP 
GSVADKRKNPPWIRRRPVWEPISDEDWYLFCGDTVEILEGKDA 
GKQGKWQVIRQRNWWVGGLNTHYRYIGKTMDYRGTMIPSEAP 
LLHRQVKLVDPMDRKPTEIEWRFTEAGERVRVSTRSGRIIPKPE 
FPRADGIVPETWIDGPKDTSVEDALERTYVPCLKTLQEEVMEAM 
GIKETR\NTRRSIGIEPGAEQLLPNFCPSLEG 


7004 


121 


2285 


FLLPVLTSRSLRQPAVPHARLGGVEPAAMKSARAKTPRKPTVKK 
G\PKRTLKTQLG/YYCRVRPLGFPDQECCIEVINWTTVQLHTPE 
GYRLNRMGDYKETQYSFKQVFGTHTTQKELFDWAUPLVNDLIH 
GKNGLLFTYGVTGSGKTHTMTGSPGEGGLLPRCLDM I FNS IGSF 
QAKRYVFKSNDRNSMDIQCEVDALLERQKREAMPNPKTSSSKRQ 
VDPEFADMITVQEFCKAEEVDEDSVYGVFVSYIEIYKNYIYDLL 
EEVPFDPINPNLHNLNCFVKIKNHNMYVAGCTEVEVKSTEEAFE 
VFWRGQKKRRIANTHLNRESSRSHSVFNIKLVQAPLDADGDNVL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


nucleotide 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N : =Asparagine , 
P= Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, v=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QEKEQITISQLSLVDLAGSERTNRTRAEGNRLREAGNINQSLMT 
LRTCMDVLRENQMYGTNKMVP YRDS KLTHLFKNYFDGEGKVRM I 
VCWPKAEDYEENLQVMRFAEVTQEVEVARPVDKAICGLTPGRR 
YRNQPRGP \ IGNE PLVTDWLQS FPPLPSCEI LDINDEQTLPRL 
IEALEKRHNLRQMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQ 
GKLNEKEKM I SGQKLE IERLEKKNKTLE YKI E I LEKTTTI YEED 
KRNLQQELETQNQKLQRQFSDKRRLEARLQGMVTETTMKWEKEC 
ERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERPSRER 
DREKVTQRSVSPSPVPVSYL 


700 5 


63 


87S 


RNMALYQRWRCLRLQGLQACRLHTAWSTPPRWLAERLGLFEEL 
WAAQVKRLASMAQKEPRTIKISLPGGQKIDAVAWNTTPYQLARQ 
ISSTLADTAVAAQVNGEPYDLERPLETDSDLRFLTFDSPSGKAV 
FWHSSTHVLGAAAEQFLGAVLCRGPSTEYGFYHDFFLGK3RTIR 
GSELPVLERICQELTAAARPFP.RLEASRDQLRQLFKDNPFKLHL 
IEEKVTGPTATVYGCGTLVDLCQGPHLRHTGQIGGLKLLSNSSS 
LWRSSG 








NA^GRKRTAVKMAAAAWLQVT.PVILt.! LCJAHPSPLSPFRASFAT 
VAAADRSKWHIPIPSGKNYFSFGKILFRNTTIFLKFDGEPCDLS 
LNITWYLKSADCYNEIYNFKAEEVELYLEKLKEKRGLSGKYQTS 
SKLFQNCSELFKTQTFSGDFMHRLPLLGEKQEAKENGTNLTFIG 
DKTAMHEPLQTWQDAPYIFIVHIGISSSKESSKENSLSNLFTMT 
VEVKGPYEYLTLEDYPLMIFFMVMCIVYVLFGVLWLAWSACYWR 
DLLR I Q FW I GAV I FLGMLEKAVF YAGFQ 


7007 




1001 


AMTVSGPGTPEPRPATPCASSVEQLRKEGNELFKCGDYGGALAA 
YTQALGLDATPQDQAVLHRNRAACHLKLEDYDKAETEASKAIEK 
DGGDVKALYRRSQALEKLGRLDQAVLDLQRCV3LEPKNKVFQEA 
LRNIGGQIQEKVRYMSSTDAKVEQMFQ'ILLDPEEKGTEKKQKAS 
QNbWLAREDAGAEKIFRSNGVQLLQRLLDMGETDLMLAALRTL 
VGICSEHQS RTVATL S I L3TRRWS ILGVES QAVS IaAACHLLQV 
MFDALKEGVKKGFRGKEGAIIVGEWKCVWGLLDVTVMEGMGLSQ 
PGQFFGDQTCSCRLFGIRFGDI ILL 


7008 


70 


1478 


CRSALGHERPPPAHLPAGGRRLQTCPRSCRWLGRPP3GLPPGPR 
SPPPLAGPGQKMVQKKPAELQGFHRSFKGQWPFELAFSLDQPDH 
GDSDFGLQCSARPDMPASQPIDIPDAKKRGKKKKRGRATDSFSG 
RFEDVYQLQEDVLGEGAHARVQTCINLIT3QEYAVKIIEKQPGH 
IRSRVFREVEMLYQCQGHRNVLELIEFFEEEDRFYLVFEKMRGG 

NILCEHPWQVSPVKICDt'DLGSGIKLNGDCSPISTPELLTPCGS 
AEYMAPEWEAFSEEASIYDKRCDLWSLGVILYILLSGYPPFVG 
RCGSDCGWDRGEACPACQNMLFES IQEGKYEFPDKDWAHI SCAA 
KDLISKLLVRDAKQRLSAAQVLQHPWQGCAPENTLPTPMVLQR 
WD5HFLLPPHPCRIHVRPGGLVRTVTVNE 


■ 7009 




626 


ARQLRNSWVDDFVAAPLIPLS00IPTGNSLYE8YYKQVDPAYTG 
RVGASEAALFLKKSGLSDIILGKIWDLADPEGKGFLDKQGFYVA 
LRLVACAQSGHEVTLSNLNLSMPPPKFHDTSSPLMVTPPSAEAH 
WAVRVEEKAKFDGIFESLLPINGLLSGDKVKPVLMNSKLPLDVL 
GRVWDLSDIDKDGHLVRDEFAVAMHLVYRALE 


7010 




571 


SHTRRAWPETLLSPLCPLLGGGTAMSGGEQKPERYYVGVDVGT 
GSVRAALVDQSGVLLAFADQPIKNWEPQFNHHEQSSEDIWAACC 
WTKKWQGIDLNQIRGLGFDATCSLWLDKQFHPLPVNQEGDS 
HRNVIMWLDHRAVSQVNRINETKHSVLQYVGG 


7011 


3 


994 


RIQTLPNQNQSQTQPLLKTPPAVLQPIAPQTTFGVQTQPQPQSL 
LQAQISAASITPLLQTQPQPLLQQPQQKAGLLQPPVRIVSQPQP 
ARRLDPPSRFSGRNDRGDQVPNRKDDRSRERERERRRSRERSPQ 
RKRSRERS PRRERERS PRRVRRWPRYTVQFS KFSLDCPS CDMM 
ELRRRYQNLYIPSDFFDAQFTWVDAFPLSRPFQLGNYCNFYVMH 
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Predicted end 
nucleotide 

corresponding 



Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 



W=Tryptophan, Y=Tyrosii 
Codon, /=possible nucle 
\=possible 



/=Valir 



REVESLEKNMAILDPPDADHLYSAKVMLMASPSMEDLYHKSCAL 
AEDPQELRDGFQHPARLVKFLVGMKGKDEAMAIGGHWSPSLDGP 
DPEKDPSVLIKT\AIRCCKALTG 



RRAGSVKRGEARLFGPTERQSERPLRPSAARRPEMLSGKKAAAA 
AAAAAAAATGTEAGPGTAGGSENGSEVAAQPAGIaSGPAEVGPGA 
VGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTWPGSATPME 
TGIAETPEG\RRTSRRKRAKVEYREMDESLANLSEDEYYSEEER 
NAKAEKEKKLPPPPPQAPPEEENESEPEEPSGVEGAAFQSRLPH 
DRMTSQEAACFPDIISGPQQTQKVFLFIRNRTLQLWLDNPKIQL 
TFEATLQQLEAP YKS DTVLVHRVHS YLERHGL INFG I YKRIKPL 
PTKKTGKVIIIGSGVSGLAAARQLQSFGMDVTLLEARDRVGGRV 
ATFRKGNYVADLGAMWTGLGGNPMAWSKQVNMELAKI KQKCP 
LYEANGQAVPKEKDEMVEQEFNRLLEATSYLSHQLDFNV1NNKP 
VS LGQALEWI QLQEKHVKDEQ I EHWKKI VKTQEELKELLNKKV 
NLKEKIKELHQQYKEASEVKPPRDITAEFLVKSKHRDLTALCRE 
YDEl^TQGKLEEKLQELEANPPSDVYLSSRDRQILDWHFANLE 
FANATPLSTLSLKHWDQDDDFEFTGSHLTVRNGYSCVPVALAEG 
LDIKLNTAVEQVRYTASGCEVIAVNTRSTSQTFIYKCDAVLCTL 
PLGVLKQQPPAVQFVPPLPEWKTSAVQRMGFGNLNKVVLCFDRV 
FWDPSVNLFGHVGSTTASRGELFLFWNLYKAPILLALVAGEAAG 
IMENISDDVIVGRCLAILKGIFGSSAVPQPKETWSRVIRADPWA 
RGSYSYVAAGSSGNDYDLMAQPITPGPSIPGAPQPIPRLFFAGE 
HTIRNYPATVHGALLSGLREAGRIADQFLGAMYTLPRQATPGVP 
AQQST 



RRAGSVKRGEARLFGPTERQSEKPLRPSAARRPEMLSGKKAAAA 
AAAAAAAATGTEAGPGTAGGSENGSEVAAQPAGL3GPAEVGPGA 
VGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTWPGSATPME 
TGIAETPEG\RRTSRRKRAKVEYREMDESLANLSEDEYYSEEER 
NAKAEKEKKLPPPPPQAPPEEENESEPEEPSGVEGAAFQSRLPH 
DRMTSQEAACFPDIISGPQQTQKVFLFIRNRTLQLWLDNPKIQL 
TFEATbQQLEAPYNSDTVLVHRVHSYLERHGLINFGIYKRIKPL 
PTKKTGKVI I IGSGVSGLAAARQLQSFGMDVTLLEARDRVGGRV 
ATFRKGNYVADLGAMWTGLGGNPMAWSKQVNMELAKIKQKCP 
LYEANGQAVPKEKDEMVEQEFNRLLEATSYLSHQLDFNVLNNKP 
VSLGQALEWIQLQEKHVKDEQIEHWKKIVKTQEELKELLNKMV 
NLKEKIKELHQQYKEASEVKPPRDITA3FLVKSKHRDLTALCKE 
YDELAETQGKLEEKLQELEANPPSDVYLSSRDRQILDWHFANLE 
FANATPIiSTLSLKHWDQDDDFEFTGSHLTVRWGYSCVPVALAEG 
LDIKLNTAVRQVRYTASGCEVIAVWTRSTSQTFIYKCDAVLCTL 
PLGVLKQQPPAVQFVPPLPENKTSAVQRMGFGNLNKWLCFDRV 
FWDPSVNLFGHVGSTTASRGELFLFWNLYKAPILLALVAGEAAG 
IMENISDDVIVGRCLAILKGIFGSSAVPQPKETWSRWRADPWA 
RGSYSYVAAGSSGNDYDLMAQPITPGPSIPGAPQPIPRLFFAG3 
HTIRNYPATVHGALLSGLREAGRIADQFLGAMYTLPRQATPGVP 
AQQSPSM 



DFEVGDKIRILATLEDGWLEGSLKGRTGIFPYRFVKLCPDTRVE 
ETMALPQEGSLARIPETSLDCLEKTLGVEEQRHETSDHEAEEPD 
CIISEAPTSPLGHLTSEYDTDRNSYQDEDTAGGPPRSPGVEWEM 
PLATDSPTSDPTEWNGISSQPQVPFHPNLQKSQYYSTVGGSHP 
HSEQYPDLLPLEARTRDYASLPPKRMYSQLKTLQKPVLPLYRGS 
SVSASRWKPRQSSPQLHNLASYTKKHHTSSVYSISERLEMKPG 
PQAQGLVMEAATHSQGDGSTDLDSKLTQQL I EFEKS LAGPGTE P 
DKILRHFSIMDFNSEKDIVRGSSKLITEQELPERRKALRPPPPR 
PCTPVSTSPHLLVDQNLKPAPPLWRPSRPAPLPPSAQQRTNAV 
SPKLLSRHRPTCETLEKEGPGHMGRSLDQTSPCPLVLVRIEEME 
RDLDMYSRAQEELNLMLEEKQDESSRAETLEDLKFCESNIESLN ' 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysir.e, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X -Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MELQQLREMTLLSSQSSSLVAPSGSVSAENPEQRMLEKRAKVIE 
ELLQTERDYIRDLEMCIERIMVPMQQAQVPNIDFEGLFGNMQMV 
I KVS KQLLAALE ISDAVG PVFLGHRDELEGTYKI YCQNHDEAI A 
LLEIYEKDEKIQKHLQDSLADLKSLYNEWGCTNYINLGSFLIKP 

YKRRKDLVLKYRKGDEDS LME KI SKLNIHS 1 1 KKSNRVS SHLKH 
LTGFAPQIKDEVFEETEKNFRMQERLIKSFIRDLSLYLQHIRES 
ACVKWAAVSMWDVCMERGHRDLEQFERVHRYISDQLFTNFKER 
TERLVISPLNQLLSMFTGPHKLVQKRFDKLLDFYNCTERAEKLK 
DKKTLEELQSARNNYEALNAQLLDELPKFHQYAQGLFTNCVHGY 
AEAHCDFVHQALEQLKPLLSLLKVAGREGNLIAIFHEEHSRVLQ 
QLQVFTFFPESLPATKKPFERKTIDRQSARKPLLGLPSYMLQSE 
ELRASLLARYPPEKLFQAERNFNAAQDLDVSLLE3DLVGVIKKK 
DPMGSQNRWLIDNGVTKGFVYSSFIiKPYNPRRSHSDASVGSHSS 
TESEHGSSSPRFPRQNSGSTLTFNPN\S\MAVSFTSGSCQKQPQ 

SADVARDVKQPTATPRSYRNFRHPEIVGYSVPGRNGQSQDLVKG 
- E i: QArEDRSTEPDGSEAEGNQVYFAVYTFKARNPNELSVSA 
NQKIiKILEFKDVTGNTEWtJIAEVNGKKGYVPSNYIRKTEYT 


7015 


1842 


513 


'.QRl HE \ VAAPSWRGARLVQSVLF L 'JPrSSLL 
GFQRRCVSCVAGSAFSGPRLASASRSN3QGSALDHFLGFSQPDS 
SVTPCVPAVSMNRDEQDVLLVHHPDMPENSRVLRWLLGAPNAG 
KSTLSNQLLGRKVFPVSRKVIITTRCQALGVITEKETQVILLDTP 
GIISPGKQKRHHLELSLLEDPWKSMESADLWVLVDVSDKWTRN 
QLSPQLLRCLTKYSQIPSVLVMNKVDCLKQKSVLLELTAALTEG 
WNGKKLKMRQAFHSHPGTHCPSPAVKDPNTQSVGNPQRIGWPH 
FKEIFMLSALSQEDVKTLKQYLLTQAQPGPWEYHSAVLTSQTPE 
ElCANIIREKLLEHLPQEVPYUVQQKrAVWEEGPGGELVIQQKL 
LVPKESYVKLLIGPKGHVISQIAQEAGHDLMDIFLCDVDIRLSV 
KLLK 


7015 


167 


2513 


ILNAPKPPPPRDSVEAVAAKRDTGGGSWGTGMDVSGQETDWRST 
AFRQKLVSQIEDAMRKAGVAHSKSSKDMESHVFLKAKTRDEYLS 
LVARL I IHFRDI HNKKSQASVSDPMNALQS LTGGPAAGAAG IGM 

TATPQTQLQLQQVAAAAAAATARSSSSSSRRRYSSSSSSSNSKQ 
FQAQQSAMQQ\QFQA\WQQQQQL\QQQQQQQQIILIKLHHQIJ-QQ 
QI QQQQQQLQRIAQLQLQQQQQQQQQQQQQQQQALQAQPP IQQ? 
PMQQPQPPPSQALPQQLQQMHHTQHHQPPPQPQQPPVAQNQPSQ 
LPPQSQTQPI.VSQAQALPGQMLYTQPPLKFVRAPMWQ2PPVQP 
QVQQQQTAVQTAQAAQMVAPGVQVSQSSL.JMLSSPSPGQQVQTP 
QSMPPPPQPSPQPGQPSSQPNSNVSSGPAPSPSSFLPSPSPQPF 
\Q5PVTARTPQNFSVPSPGPLNTPVNPSSVMSPAGSSQAEEQQY 
LDKLKQLSKYIEPLRRMINKIDKNEDRKKDLSKMKSLLDILTDP 
SKRCPLKTLQKCEIALEKLKNDMAVPTPPPPPVPPTKQQYLCQP 
LLDAVLANIRSPVFNHSLYRTFVPAMTAIHGPPITAPWCTRKR 
RLEDDERQSIPSVLQGEVARLDPKFLVNLDPSHCSNNGTVHLIC 
KLDDKDLPSVPPLELSVPADYPAQSPLWIDRQWQYDAWPFLQSV 
HKCMTSELLQLPDKHSVTALLNTWAQSVHQACLSAA 


7017 


1 


1785 


_.H_,GMTCYI'INSVI*ALFMATDFRRQVLSLNLNGCNSLMKKLQHL 
FAFLAHTQREAYAPRIFFEASRPPWFTPRSQQDCSEYLRFLLDR 
LHEEEKILKVQASHKPSEILECSETSLQEVASKAAVLTETPRTS 
DGEKTLIEKMFGGKLRTHlRCLNCRSTSQKAEAFTDLSLAFWPS 
YSLEYMS CPDCSQSPSI QDGGLMQASVPGPS EEPWYNPTTAAF 
ICDSLVNEKTIGSPPNEFYCSENTSVPNESNKILVNKDVPQKPG 
GETTPSVTDLLNYFLAPEILTGDNQYYCENCASLQNAEKTMQIT 
SEPEYLILTLLRFSYDQKYHVRRKILDNVSLPLVLELPVKRITS 



587 
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Predicted 

nucleotide 

irresponding 
i first ^ 

sequence 



Predicted er 

location 
correspondir 



residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
l>=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
FSSLSESWSVDVDFTDLSENLAKKLKPSGTDEASCTKLVPYLLS 
SVWHSGISSESGHYYSYARNITSTDSSYQMYHQSEALALASSQ 
SHLLGRDSPSAVFEQDLENKEMSKEWFLFNDSRVTFTSFQSVQK 
ITSRFPKDTAYVLLYKKQHSTNGLSGNNPTSGLKINGDPPLQKE 
LMDAITKDNKLYLQEQELNARARALQAASASCSFRPNGFDENDP 
PGSCGPTGGGGGGGFNTVGRLVF 
SLVFRGNTWSGEAGHHCSALFNLAAYHQLFVGTERIRAPEIIFQ 
PS L I GEEQAG I AETLQ Y ILDRYPKDVQEMLVQNVFLTGGSTTMYP 
GMKARMEKELLEMRPFRSSFQVQLASNPVLDAWYGARDWALNHL 
DDHEVWITRKEYEEKGGEY.LKEHCASNIYVPIRLPKQASRSSDA 

QASSKGSAAGGGGAGEQA 

APGGFLVTMVFPAPSPPWMLGCCSHEVTAGPPTLCKDMSALVAA 
RMRHIPLAPGSDWRDLPKIEVRLSDGTMARKLRYTHHDRKNGRS 
SSGALRGVCSCVEAGKACDPAARQFNTLIPWCLPHTGMRHNHWA 
GLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSV3ECAR 
SQGFPDTYRLFGNILDKHRQVGNAVPPPLAKAIGLEIKLCMLAK 

ARESASAKIKEEEAAKD 

FADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARMTFKRVR 
NGFLMRKVAVFFSNTPTRASPQLREAVLKLSDAGITPLFLTRQE 
DRQLINALQINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDIC 
NIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQF 
NEMKKYIAYLVRQLDMSPDPKASQHFARVAWQHAPSESVDNAS 
MPPVKVhF-,11 1SK1 i LFLSRGMTQLQGTRALGSAIEYTI 
ENVFESAPNPRDLKIWLMLTGEVPECQLEEAQRVILQAKCKGY 
FFWLGIGRKVNIKEVYTFASEPNDVFFKLVDXSTELNEEPLMR 
FGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNLVK7GHKQ 
VNVP1WVTSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPVTI 
INQPSVKPAAAKPAPAKPVAAKPVATKTATVRPPVAVKPATAAK 
PVAAKPAAVRPPAAAAAKPVATKPEVPRPQAAKPAATKPATTKP 
MVKMSREVOVFEITENSAKLHWERPEPPGPYFYDLTVTSAHDQS 
LVLKQNLTVTDRVIGGLLAGQTYHVAWCYLRSQVRATYHGSFS 
TKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLPKDEG 
TCRDF I LKWYYDPNTKS CARFWYGGCGGKENKFGSQKECEKVCA 



VNAVSFFPNGYAFATGSDDATCRLFDLRADQEliLljYSHDNIICG 
ITSVAFSKSGRLLLAGYDDFNCNVWDTLKGDRAGVLAGHDNRVS 
CLGVTDDGMAVATGSWDS7LRIWN 
VYIGSFWSHPLLIPDMRKLFEAEEQDLFRDIQSLPRNAALRKLN 
DLIKRARLAKVHAYIISSLKKEMPSVFGKDNRKKELVNNLAEIY 
GRIEREHQISPGDFPNLKRMQDQLQAQDFSKFQPLKSKLLEWD 
DMLAHD I AQLMVLVRQE E SQRP I QMVKGGAFEGTLHG P FGHG YG 
EGAGEG I DDAE W WARDKPM YDE I F YTLS PVDGKI TGANAKKEM 
VRSKLPNSVLGKIWKLADIDKDGMLDDDEFALANHLIKVKLEGH 
ELPNELPAHLLPPSKRKVAE 
AMVFGG WP Y VPQ YRD I RRTQNADGFS TYVCLVLL VAN ILRILF 
WFGRRFESPLLWQSAIMILTMLLMLKLCTEVRVANELKARRRSF 
TAADSKDEEVKVAPRRSFLDFDPHHFWQWSSFSDYVQCVLAFTG 
VAGYITYLSIDSALFVETLGFLAVLTEAMLGVPQLYRNHRHQST 
EGMSIKMVLMWTSGDAFKTAYFLLKGAPLQFSVCGLLQVLVDLA 
ILGQAYAFARHPQKPAPHAVHPTGTKAL 
RTGVTGWAQVWMFGGGGVLSSGEQLQMPVKPERGLSPSDGWLV 
SSRRGSPGTVLGIiPFWLLTPVLVSRSIRSMLLLTRSPlAWHRLS 
QLKPPVLPGTLGGQALHLRSWLLSRQGPAETGGQGQPQGPGLRT 
RLLITGLFGAGLGGAWLALRAEKERLQQQKRTEALRQAAVGQGD 
FHLLDHRGRARCKADFRGQWVLMYFGFTHCPDICPDELEKLVQV 
VRQLEAEPGLPPVQPVFITVDPERDDVEAMARYVQDFHPRLLGL 
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ID 

NO: 


Predicted 
beginning 
nucleotide 

corresponding 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








r 5 1 VAQASHSYRVYYNAGPKDEDQDYIVDHS IAIYLLNPDG 
LFTDYYGRSRSAEQ1 SDSVRRHMAAFRSVLS 


7025 


232 


832 


ERXSPIGNNENL*K\HSLDCLCFRGDWEGNTQFQTLQDWQEECF 
KQVIRTCEKRPTFNQHTOTNLHQRLNTGDKLNEFKELGKAFISG 
SDHTQHQLIHTSEKFCGDKECGNTFLPDSEVIQYQTVHTVKKTY 
ECKECGKSFSLRSSLTGHKRIHTGEKPFKCKDCGKAFRFHSQLS 
VHKRIHTGEKS YECKECGKAFS CG 




328 


1146 


NPNPS IGDI KDIKKAAKSMLDPAHKSHFHPVTPSLVFLCFIFDG 
LHQALLSVGVSKRSNTWGNENEER3TPYASRFKDMPNFIALEK 
SSVLRHCCDLLIGVAAGSSDKICTSSLQVQRRFKAMMASIGRLS 
HGESADLLISCNAESAIGWISSRPWVGELMFTFLFGDFESPLHK 
LRKSS*LPRKHR*QPINAVRMFLDQCMDGS1ALRAIVSEIPVFE 
E KKNNG *KGIGEIF* VWG CTL P PHYWGAVTTNVPKLSNS GKLLG 
QDEQPHIFG 


7027 


43 


954 


GRRLQQCQRPEDAEDGAEGGGKRGEAGWEGGYPEIVKENKLFEH 
YYQELKIVPEGEWGQFMDALREPLPATLRITGYKBHAKEILHCL 
KNKYFKELEDLEMDGQKVEVPQPLSMYPBBLAWHTNLSRXILRK 
SPHLEKFHQFLVSETESGNISRQEAVSMIPPLLLNVRPHHKILD 
MCAAPGSKTTQLIEMLHADMNVPFPEGFVIANDVDNKRCYLLVH 
QAKRLSSPCIMWNHDASSIPRLQIDVDGRKEILFYDRILCDVP 
CSGDGTMRKNIDVWKKWTTbNSLQLHGLQLRIATRGAEQT, 


7028 


189 


608 


SRPPP3P_FT TMVR r -i f jVPGTFST mi u 
KKMQSWYSMLSPTYKQRNEDFRKLFSKLPEAKRLIVDYSCALQR 
EILLQGRLYLSENWICFYSNIFRWETTISIQLKEVTCLKKEKTA 
KLIPNAIQ 


7029 


1343 


40 


VLESNTEAKQATGTSSKLRHBTGQEKGREGPRCP3GLAQLRLWG 
/PCPHAGRETGPRASAPIPGS*GHGWHW*RKDGRGERSEGPSAL 
SPHSPSLLNMQQAPTHVGPGMGSQRPRSSWPEQVGVGSQLSRE 
RWRA*RSLPGAAASERTEMTKERSP/RPCOGYDSSNWFTQPGKK 
TRKRNSRRNTMVSRGGGCLLYPLQSIMPE*QLR*GAHASPPTQG 
R*GKGGPRSPLTKASGTTHIPTPFFGSIP/RPTRDSGPGTDNS\ 
AAPGQKRGHREA*QGPEPV/WGRVTTHLQGPAG*TKPLGS\RNW 
VPGPAEGEQGEGAGLEGRP*PLKGCRSTLTFSPQLSIPMVGKKP 
PEGTTASFFP\RSCHSE*RKPPPSCPHAPALSLPHPLiPLPLPPL 
PLPLPGAGT*HSARSGRPGQSETGSLCHNCHHCPPHCPKCSPGG 
T 


7030 


2 


521 


FVCFSAPGSGQGGKRRVKMELSAVGERVFAAEALLKRRIRKGRM 
EYLVKWKGWSQKYSTWKPEENILDARLIAAFEEREREMELYGPK 
KRGPKPKTFLLKAQAKAKAKTYEFRSDSARGIRIPYPGRSPQDL 
ASTSRAREGLRN\RVCPRQRAAPAPAAP\PRRGPSGPGPRPG*G 
PGLHFPGPGGPSKHGFVPASEQHQHQQHLPRRGPSGPGPRPG 


7031 


960 


59 


HCSVPGAEWPRKPPAQICPQLTSRPHLSSPRSLSPGCGHSPGPG 
/ CKPS/RHCDELHEGPSRTAALPCGKPQPKHGVEECG/PCPCIiA 
PRRLTEPPALTVSPVGRAAPSGAL*PSGRACSACSHRLAPEAAL 
SAAAPRPSLGSGQIJASGLPAASLPPQDSSQPHKTVPSPARSVPP 
LGAQARAAPPRLWCPRALVSG*EASPEAVSVAAGPPVPGPTPST 
SGSTASHSRRGC*SPR*TPAPPRRDHGRSAAFEVLTAAASAQPC 
ASQGGPRP TGAGRT PSPLGLPFSRGP PAASARP FCRHPSL 


7032 


1393 


2104 


RRPGRTEPVEPPPVPPPPRASNSKSRCR*RNLHLAPL*QSPLRK 
SRQIGTSSLPFGRSAGERPRPAATFCLSRGGSSPVFL+PSSSSL 
EPWM ^QFGRLHSLFWKSWQKMNSFLLTPKLDTSLMSGWRYRQR 
LPRLHTFLKKSLQMASELAPPLPTPAPLASSLPPPPGPPPLLPV 
PLA*LSRSGILVPPNSGFSLSC\PLGDH*GSSGEVRGSCGSPPP 
HHCWVLPPPP*LLLPPR 


7033 


689 


815 


RSRDCLSSSATSNRARRSKCSGPKRATPLDSGPGP*APPGPSSA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Pi ii :r-V e.i.l 
nucleotide 
location 
corresponding 
to first 

sequence 


Ammo acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LMMPSSCPWRTGALGPSPAGSRALGRCTSSVGPGSRWLTRTSSP 
GCATRTWRTMRMEPRPLRSRMGESAPGIPAELPSAAPSGPSAPS 
AAAPSAPTTPAAAGPNTL*SRRTAEWCWPPSCSCCWGWC*SWSA 
WDWRRPPLQVSPAPSSSCRASCCWCLESIT*SSSTARSRATGAS 
SSSTCPTSRSDRGAAWTP\SPMGAPIjLPCSVPLISREEALQDPR 
NPSP*GVCSGSSGHAGLALGKPPVACSVP 








EDTSSMPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYRSRV 
KAMFYHAYDSYLENAFPFDELRPLTCD3HDTWGSFSLTLIDALD 
TLL\TLFYFQILGNVSEFQRWEVLQDSVDFDIDVNASVFETNI 
RWGGLLSAHLLSKKAGVEVEAGWPCS3PLLRMAEEAARKLLPA 
FQTPTGMPYGTVNLLHGTOPGETPVTCTAGIGTFIVEFATLSSL 
TGDPVFEDVARVALMRLWESRSDIGLVGNHIDVLTGKKVAQDAG 
IGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNKAIRNYTRFDDW 
YLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYY 
TVWKQFGGLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRAT 
GDPTLLELGRDAVESIEKISKVECGFATIKDLRDHKLDNRMESF 
FLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGYI 
FNTEAHPIDPAALHCCQRLKEEQWEVE3LMREFYSLKRSRSKFQ 
KNTVSSGPWEPPARPGTLPSPENHDQARERKPAKQKVPLIiSCPS 
QPFTSKLALLGQVFLDS S * PLDNFF I F I FLRLN YNKLLLAI I KK 
K 








EDT^SMPFRLLIPLGLLCALLPQHHG? PD i.PDPAHYRERV 
KAMFYHAYDSYLENAFPFDELRPLTCDGHDTWGSFSLTLIDALD 
TLL\TLFYFQILGNVSEFQRWEVLQDSVDFDIDVNASVFETNI 
RWGGLLSAIILLSXKAGVEVEAGWPCSGPLLRMAEEAARKLLPA 
FQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSL 
TGDPVFEDVARVALMRLWESRSDIGLVGNHIDVLTGKWVAQDAG 

YLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYY 
TVWKQFGGLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRAT 
GDPTLLELGRDAVESIEKISKVECGFATIKDLRDHKLDNRMESF 
FLAETVKYLYLLFDPTNFIHIJWGSTFDAVirPYGECrLGAGGYI 
FNTEAHPIDPAALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQ 
KNTVSSGPWEPPARPGTLFSPENHDQARERKP^KQKVPLLSCPS 
QPFTS KLALLGQVFLDSS * PLDNFF1 F I FLRLNYNKLLLAI I KK 
K 


7036 


442 


761 


CLAPLFSCFQIINLHLAPSGRLRWAViLRG?GRN*LPGEGPSIPT 
RNW*ERKAGCSQPC/PAQQHHGRPPGVSPLPRDPHPTTLRPLPP 
PPPPPPPPPRRPPRNRRPG 


7037 






RNW*ERKAGCSQPC/PAQQHHGRPPGVSPLPRDPHPTTLRPLPP 
PPPPPPPPPRRPPRNRRPG 


7038 


155 


891 


GAGAASDMS SGLRAADFPRWKRHI SEQLRRRDRLQRQAFEE I IL 
QYNKLLEKSDLHSVuAQKLQAEKHDVPNRHEISPGHDGTWNDWQ 
LQEMAQLRIKHQEELTELHKKRGELAQ\RVIDLNNQMQRKDREM 
QMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDA 
LQITFTALEGKLRKTTEENQELVTRWMAEKAQEANRLNARE*KR 
LQEAASPAAERACRS SKGTS TSRTG 


7039 


155 


891 


GAGAASDMSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEIIL 
QYNKLLEKSDLHSVLAQKLQAEKHDVPNRHEISPGHDGTWNDNQ 
LQEMAQLRIKHQEELTELHKKRGELAQNRVIDLNNQMQRKDREM 
QMNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDA 
LQ I TFTALEGKLRKTTEENQELVTRWMAEKAQEANRLNARE *KR 
LQEAASPAAERACRSSKGTSTSRTG 


7040 






KITPPRRPHRCSSGHGSDNSSVLSGELPPAMGKTALFYHSGGSS 
GYESVMRDSEATGSASSAQDSTSENSSSVGGRCRSLKTPKKRSN 
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SEQ 
NO: 


Predicted 
beginning 

nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
V=possible nucleotide insertion) 








PGSQRRRLIPALSLDTSSPVRKPPNSTGVRWVDGPLRSSPRGLG 
EPFEI KVYE IDDVERLQRRRGGASKEAMCFNAKLKI LEHRQQRI 
AEVRAKYEWLMKELEATKQYLMLDPNKWLSEFDLEQVWELDSLE 
YLEALECVTERLESRVNFCKAHLMMITCFDIT 


7041 


1 


567 


SGRVAMGRRRAPAGGSLGRALMRHQTQRSRSHRHTDSWLHTSEL 
NDGYDWGRLNLQSVTEQSSLDDFLATAELAGTEFVAEXLNIKFV 
PAEARTGLLSFEESQRIKKLHEENKQFLCIPRRPNWNQNTTPEE 
LKQAEKDNFLEWRRQL\VRLEEEQKLILTPFERKLDFWRQLWRV 
IERSDIWQIVDA 


7047. 




345 


P I HMAAAALRAD I\ISPLFPHI QGYLLLSASHG \ATSLHTKGAL 
PLETVTMYTVIPKSKYVLVKPDTQYPYSENLDEFKRIAENSASN 
DDLLMAEVAI SDYGDKLTLELREKY 


7043 


2 


2170 


ARGMAARDSDSEEDLVSYGTGLEPLEEGERPKKFIPLQDQTVRD 
EKGRYKRFHGAFSGGFSAGYFNTVGSKEGWTPSTFVSSRQNRAD 
KSVLGPEDFMDEEDLSS1'X3IAPKAIVTTDDFASKTKDRTREKAR 
QLAAATAP I PGATLLDDLI TPAKLS VGFELLRKMGWKEGQGVGP 
RVKRRPRRQKPDPGVKIYGCALPPGSSEGSEGEDDDYLPDNVTF 
APKDVTPVDFTPKDNVHGLAYKGLDPHQALFGTSGEHFNLFSGG 

K D 1 'LI.DEEPGDGLYGWTAPRQYKNQKESEKDLRYVGKILDGF 
SLASKELSSKKIYPPPELPRDYRPVHYFRPMVAATSENSHLLQV 
LSESAGKATPDPGTHSKHQLNASKRAELLGETPIQGSATSVLEF 

AGHCSWNMALGGGTATLKASNFKPFAKDPEKQKRYDEFUyHMKQ 
GQKDALERCLDPSMTEWERGRERDEFARAALLYASSHSTLSSRF 
THAKEEDDSDQVEVPRDQENDVGDKQSAVKMKMFGKLTRDTFEW 
HPDKLLFQ/RLVGLPRVKRDKYSVFNFLTLPETASLPTTQASSE 
KVSQHRGPDKSRKPSRWDTSKHEKKEDSISEFLRLARSKAEPPK 
QQSSPLVNKEEEHAPEIjSAN 


7044 






FPQSRKDILKDLVEMCRGVQHPLRGLFLRKYLLQCTRNILPDEG 
EPTDEETTGDISDSMDFVLLNFAEMNKLWVEMQHaGHSRDREKR 
ERERQELRILVGTNLVRLSQV 


7045 


3 


513 


LGFKMEALSRAGQEMSLAALKQHDPYITSIADLTGQVALYTFCP 
KANQWEKTDIEGTLFVYRRSASPYHGFTIVWRLWMHNLVEPV1QK 
DLEFQLHEPFLLYRNASLSIYSIWFYDKNDCHRIAKLMADWEE 
ETRRSQQA/RSGQTESQPGQWLQKPQAHKHPGDAEQSQG 


7046 


3 


513 


LGFKMEALSRAGQEMSLAALKQHDPYITSIADLTGQVALYTFCP 
KANQWEKTDIEGTLFVYRRSASPYHGFTIVNRLNMHNLVEPVNK 
DLEFQLHEPFLLYRNASLSIYSIWFYDKNDCHRIAKLMADWEE 
ETRRSQQA/RSGQTESQPGQWLQRPQAHRHPGDAEQSQG 


7047 


103 


486 


QI 1 I E -CCCWSEGLTS I KGNCHNFYTAI S KDVTYKELKNLLNSKN 
IMLIDVRE IWEILEYQKI PES INVPliDEVGEALQMtJPRDFKEKY 
NEVKPSKSDS/IVFSYLAGVRSKKALDTAISLGFHSYYER 


7048 




627 


f e clvllsswdyrhhatrrvisspvftmedsgktfsseeeeany 
wkdlamtykqraentqeelrefqegsreyeaeletqlqqietrn 
rdllsennrlrmeletikekfevqhsegyrqisaleddlaqtka 
ikdqlqkyireleqanddlerakratdhglsktfe\qrln\qai 

EKKW 


7049 


393 


93 8 


KRTGSA3YGGPPPGLGGPATXASVAGRCSSVGKIPARRCYEDEL 
VPVFEAVGRIYELRLMMDFDGKNRGYAFVMYCHKHEAKRAVREL 
NNYE1RPGRLLGVCCSVD1ICRLFIGGIPKMKKREEILEEIAKVT 
EGVLDVI VYASAADKMKNRGLRLRGVREP PRGCHWLGRKL IAWX 
ASSLWG 


7050 


393 


938 


KRTGSASYGGPPPGLGGPATXASVAGRCS3VGKIPARRCYEDEL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue^r 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Ehenylalanine, G=Glycine, 
H=Histidine, I=Iscleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R-Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VPVFEAVGRIYELRLMMDFDGKNRGYAFVMYCHKHEAKRAVREL 
NNYEIRPGRLjLGVCCSVDNCRLFIGGIPKMKKREEILEEIAKVT 
EGVLDVIVYASAADKMKNRGLRLRGVREPPRGCHWLGRKLIAWX 
ASSLWG 


70S1 


119 


81S 


KKMNLAEICDNAKKGREYALLGNYDSSMVYYQGVMQQIQRHCQS 
VRDPAIKGKWQQVRQELLEEYEQVKS1VGTLESFK1DKPPDFPV 
SCQDEPFRDPAVWPPPVPAEHRAPPQIRR/RQSRSKTSEERNGR 

ASDGEMPKFDGAGYDKDLVEALERDIVSRNPSIHWDDIADLEEA 
KKLLREAGVLPMWM 


7052 


467 


715 


SCPGRGKMSKLLNPEEMT3RDYYFDSYAHFGIHEEMLKDEVRTL 
TYRNSMYHNKHVFKDKWLDVGSGTGILSMFAARQGPRR 


7053 


467 


715 


T T r r l'sh f i L i nj j 

TYRNSMYHNKHVFKDK^^^F 3SGTGTT i IPAARQGPRR 


7054 




1036 


31 L [DARRRSAGAEPTARLPWPAALEEWPSCPCEPLGEG 
RRCRWDAMEYDEKLARFRQAHLNPFNKQSGPRQHEQGPGEEVFD 
VTPEEALPELPPGEPEFRCPERVMDLGLSEDHFSRPVGLFIiASD 
VQQLRQAlEECKQVILELPEQSEKQKDAWRLIHLRJiKLQFLKD 
PNEDSPNIRVLLEHRFYKEKSKSVKQTCnKCNTIIWGLIQTWYT 
CTGCYYRCHSKCLNLISKPCVSSKVSHQAEYELNICPETGLDSQ 
DYRCAECRAPI/CS/DGWPSEARQCDYTGQYYCSHCHWNDLAV 
IPARWHNWDFEPRKVSRCSMRYLALMVSRPVLRLREIN 


7055 


• 


527 


1 b IK/KGKIILCLFIWLSMNVLT.FWKTFr.LYNQGP" ' 
EYHYLHQMLG/ALCLSRASASVLNLNCSLILLPMCRTLliAYLRG 
SQKVPSRRTRRLLDKSRTFHITCGATICIFSGVHVAAHLVNAIjN 
^ GLTGVCMEWjFL 


705S 


2 


527 


dsrrvswrswlawe/wgkhlclfiwlsmnvllfwktfllynqgp 
eyhylhqmlg/alclsrasasvlnlncslillpmcrtllaylrg 
sqkvpsrrtrrlldksrtfhitcgaticifsgvhvaahlvnalv 
fsvnysedfvelnaaryrdedprkllfttvpgltgvcmewlfl 

M 


7057 


1368 




431 


giyliivrekiprptcigdrqendkenlnlenhrdqellhascqa 
sgevpsqaslrgfftedepgcfgegenlpealqniqdegtgeql 
spqerisekqlgqhlpnphsgemstmwleekretsqkgqprapm 
aqklptcrecgktfyrnsqlifhqrthtgetyfqctickkaflr 
ssdfvkhqrthtgekpckcdycgkgfsdfsglrhhekihtgekp 
ykcpiceksfiqrsnfnrkqrvhtgekpykcshcgksfswsssl 
dkhqrshlgkkpfq*pvtklsfpisisqpshkntqlhqeelclr 
gypc 


7058 




469 


fsgfgavpdalgcrmsdlriteaflymdylcfralcckgpppar 
peydlvcigltgsgktsllsklcsespdnwsttgfsikavpfq 
nailnvkelggadnirkywsryyqgsqgvifvldsasseddlea 
arn*sctqllqhpqlctlpflila 


7059 


1 


1178 


IP I "-FOPAAAAMDALLGTGPRRARGCLGAAGPTSSGRAARTPA 
APWARFSAWLECVCWTFDLELGQALELVYPNDFRLTDKEKS S 1 
C"-'T„q?PDSHSGCLGDTQFSFRMRQCGGQRSPWHADDRHYNSRAP 
VALQREPAHYFGYVYFRQVKDSSVKRGYFQKSLVLVSRLPFVRL 
FQALLSLIAPEYFDKLAPCLEAVCSEIDQWPAPAPGQTLNLPVM 
GWVQVRIPSRVDKSESSPPKQFDQENLLPAPWLASVHELDLF 
RCFRPVLTHMQTLWELMLLGEPLLVLAPSPDVSSEMVLALTSCL 
QPLRFCCDFRPYFTIHDSEFKEFTTRTQAPPNWLGVTNPFFIK 
TLQHWPHILRVGEPKMSGDLPKQVKLKKPFKV*RPWDTKP 


7060 


90 


1670 


?JL E 1 F] ^MDS TKS E PLKGS PEAEDGN I EYKKLVNPSQ 
YRFEHLVTQMKWRLQEGRGEAVYQIGVEDNGLLVGLAEEEMRAS 
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SEQ 
ID 


Predicted 
beginning 

location 
corresponding 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 

W=Tryptophan, Y= Tyrosine , X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKTLHRMAEKVGADITVLREREVDYDSDMPRKITEVLVRKVPDN 

LHEIQSGRTSSISFEILGFNSKGEVHGINGTQWGQTLRMGW*** 
RT*DGGRWRLFEIV*MNALRGL*TSSAPLRKSKGNQLN*IKNG 
VKIKRQGHPGNGLGPGNSEGVGRAGRRH*GPWALGQVVNYSDSR 
TAEEICESSSKMITFIDLAGHHKYLHTTIFGLXSYCPDCALLLV 
SANTGIAGTTREHLGLALALKVPFFIWSKIDLCAKTTVERTVR 

SVSGESLDLLKVFLNILPPLTNSKE3EELMQQLTEFQVDEIYTV 
PEVGTWGGTLSR* IDLIATLPTQPSFIYSKTSWPKGGDPGI 


7061 


364 


710 


ARMPSPLGPPCLPVMDPETTLEEPETARLRFRGFCYQEVAGPRE 
ALARLRELCCQWLQPEAHSKEQMLEMLVLEQFLGTLPPEIQAWV 

EEPETARLRFRGFCYQEVAGPREALARLRELCCQWLQPEAHSKE 
QMLEMLVLEQFLGTLPPEIQAWVRGQRPGSPEEAAALVEGLQHD 
PGQLLG 


7062 


71 


744 


AKAGTNLERLHWLSYFFCIPKHKLKSSQKDKVRQFMACTQAGER 

YGRYKDPQDENKIGVDGIQQFCDDLSLDPASISVLVIAWXFRAA 
TQCEFSRKEFLDGMTELGCDSMEKLKALLPRL3QELKDTAKFKD 
FYQFTPTFAKNPGQKGLDL*MAGAYWKLVLSGRFKFLYLWNTFL 
MEHH 








LTEYI HCLQPATGRWRMLL I WSVCTATGAWNWLIDPETQKVS F 
FTSLWNHPFFTISCITLIGLFFAGIHKRWAPSIIAARCRTVLA 

HKGFLLRLDM 


7064 


300 


884 


RDTGSDPSSTRRLCSTCCTGH*PAE?IASPHPSR3TCPPASSAS 
SRRTGCWTCPPESGHAQARRSRRASASRKGARGAVRSAVAARC-C 
SSRAGRWLETPGRRRGPPACAAAAGRLRGPAP*AAPPTASVPAR 
CRCPAARTGAPAAATWLRRRLSGLRAPALGRRRSPGPSPKSAAP 
PLLTPLGAGRAGGSRANS 








RSQSEPKAPPPQKRSEAAFASKAHSPVAVQVPGMQNNIADPEEL 
FTKLERIGKGS FGEVFKG IDNRTQQWAIKI IDLEEAEDEI EDI 
QQEITVLSQCDSSYVTKYYGSYLKGSKLWIIMEYLGGGSALDLL 
RAGPFDEFQ 


706S 






YNQKHSLFNAMNRFIGAVNNMDQTVMVPSLLRDVPIADPGLDND 
VGVE VGGSGG CLE ERTP P 


7067 


152 


973 


KENI TMATE IGS P PRFFHMPRFGHQAPROLFYKRPDFAQQQAMQ 
QLTFDGKRMRICAVNRKTIDYNPSVIKYLEWRIWQRDQRDMRAIQ 
PDAGYYNDLVPPIGMLNNPMNAVTTKFVRTSTNKVKCPVFVVRW 
TPEGRRLVTGASSGEFTLWNGLTFNFETILQAHDSPVRAMTWSH 
NDMWMLTADHGGYVKYWQSNMNNVKMFQAHKEAIREARFIHNIP 
FSWPIVMVKLFSKCILGAEMHGLCQFLGNFLHPINTIFFFVFT 
HSPFCWAPF 


7058 


222 


816 


D*3 M.\EYVLLIjFLALCSAKPFFSPSHIALKNMMLKDMEDTDDDDD 
DDDDDDDDDDEDNSLFPTREPRSHFFPFDLFPMCPFGCQCYSRV 
VHCSDLGLTSVPTNIPFDTRMLDLQNNKIKEIKENDFKGLTSLY 
GLILNNNKLTKIHPKAFLTTKKLRRLYLSHNQLSEIPLNLPKSL 
AELRI H2JJKVKKI QKDTFKKK 


7059 


114 7 


1765 


FRDHRRYFYVNEQSGESQWEFPDGEEEEEESQAQENRDETLAKQ 
TLKDKTGTDSNSTESSETSTGSLCKESFSGQVSSSSLMPLTPFW 
TLLQSWVPVLQPPLPLEMPPPPPPPPESPPPPPPPPPAPKMPPP 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine / K=Lysine, 
L=Leucine, M=Methionine , M=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, ¥=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








1RK.DKAKKSKTKMPSLVKKWQSIQRELDEEDNSSSSEED 
RVSTAQKRIEEWKQQQLVSGMAERNANFEA 


7070 


1 




DGTMEDSEAVQRATALIEQRLAQEEENEKLRGDARQKLPMDLLV 
LEDEKHHGAQSAALQKVKGQERVRKrSLDLRREIIDVGGIQNLI 
ELRKKRKQKKRDALAASHEPPPEPEEITGPVDEETFLKAAVEGK 
MKVIEKFLADGGSADTCDQFRRTALHRASLEGHMEILEKLLDNG 
ATVDFQ 


7071 


2 


921 


ARGTLRALETAKKVGKVGANGQKAAGPSADSVTENKIGSPPKTP 
VSNVAATSAGPSNVGTELIJSVPQKSSPFLTRVPAYPPHSEWIQY 
FQDPRTQIPFEVPQYPQTGYYPPPPTVPAGVAPCVPRFVRSNNV 
PESSLPPASMPYADHYSTFSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGMYAPVYDSRRIWRPPMYQRDDIIRSNSLPPMDVMHSSVYQT 
SLRERYNSLDGYYSVACQPPSEPRTTVPLPREPCGHLKTSCEEQ 
IRRKPDQWAQYHTQKAPLVSSTLPVATQSPTPPSTLNRGEGS 


7072 


2 


921 


ARGTLRALE TAKKVGKVGANGQKAAGP S ADS VTENKI GS P P XTP 
VSNVAATSAGPSNVGTELIISVPQKSSPFLTRVPAYPPHSENIQY 
FQDPRTQIPFEVPOYPQTGYYPPPPTVPAGVAPCVPRFVR^MMV 
PESSLPPASMPYADHYSTFSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGMYAPVYDSRRIWRPPMYQRDDIIRSNSLPPMDVMHSSVYQT 
SLRERYNSLDGYYSVACQPPSEPRTTVPLPREPCGHLKTSCEEQ 
IRRKPDQWAQYHTQKAPLVSSTLPVATQSPTPPSTLNRGEGS 


7073 


50 


504 


LAEGSFGVSDFPAPAAAPAHTLTSFSGSLSPQFRKPLGRAPAMP 
LVRYRKV1 TT F T AHQFVEGEFSEGYDPTVENTYSKI 
VTLGKDE FHLHLVDTAGQDE YS ILPYSFII GVHGYVLVYS VTSL 
HSFQVIESLYQKLHEGHGK 


7074 


263 


1003 


VCPVLCSTRQEPGHSSLVTYFGKPTRRKEFLLGHCIAAGKMNIS 
VDLETNYAELVLDVGRVTLGEN^PI i " u iNEPVSRAM 
CALLNSGGGVIKAEIENEDYSYTKDGIGLDLENSFSNILLFVPE 
YLDFMQNGNYFLIFVKSWSLNTSGLRITTLSSNLYKRDITSAKV 
MNATAALEFLKDMKKTRGRLYLRPELLAKRPRVDIQEENKMKAL 
AGVFFDRTELDRKEKLTFTESTHVEI 


7075 


598 


1005 


NYINFFFRKEYPPHVQKVEINPVRLSRLQGVERIMKKTEESESQ 
VEPEI KRKVQQKRHCS TYQPTPPLS PAS KKCLTHLEDLQRNCRQ 
AITLNESTGPLLRTSIHQNSGGQKSQNTGLTTKKFYGNNVEKVP 
IDII 


7076 


279 


1049 


LQSB-SI Th L r F i,r") LRDSKA1 

SPLTGYVRFMNERREQL.RAKRPEVPFPEITRMLGNEWSKLPPEE 

KQRYLDEADRDKEKYMKELEQYQKTEAYKVFSRKTQDRQKGKSH 

RQDAARQATHDHEKETEVKERSVFDIPIFTEEFLNHSKAREAEL 

RQLRKSNMEFEERNAALQKHVESMRTAVEKLEVDVIQERSRNTV 

LQQHLETLRQVLTSSFASMPLPEXGETPTVDTIDSYM 


7077 




1119 


SSMGSNSEINGLALRKTDKYGFLGGSQYSGSLKSSIPVDVARQR 
ELKWLDMFSNWDKWLSRRFQKVKLRCRKGIPSSLRAKAWQYLSN 
SKELLEQNPRKFEELERAPGDPKWLDVIEKDLHRQFPFHEMFAA 
RGGHGQQDLYRILKAYTIYRPDEGYCQAQAPVAAVLLMHMPAEQ 
AFWCLVQICDKYLPGYYSAGLEAIQLDGE1FFALLRRASPLAHR 
LR RIE PVLYMTEWFMCI FARTLPWASVLRVWDMFFCE3VKI 
IFRVALVLLRHTLGSVEKLRSCQGMYETMEQLRNLPQQCMQEDF 
LVHEVTNLPVTEABIERENAAQLKKWRETRGELQYRPSRRLHGS 
RAIHEERRRQQPPLGPSSS 


7078 


483 


767 


FQGQRMAGEQKPSSNLLEQFILIAKGTSGSALTALISQVLEAPG 
VYVFGELLELANVQELAEGANAAYLQLLWLFAYGTYPDYIANKE 
SLPELY 


7079 


2 


37S 


SWEFKRPKEPSGSDGESDGPIDVGQEGQLSQMARPLSTPSSSQ 
MQARKKRRGIIEKRRRDRINSSLSELRRLVPTAFEKQGSSKLEK 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

sequence 


Predicted end 
nucleotide 

c o r r e sponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AE VLQMTVDHLKMLHATGGTGTHALLFQAS FI QQ I F 








LTHPLCHQDWSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPS 
CLFSGDLLFLSGCGEFPRKREELGEEGETEVRAATVPWRALKP 


7081 


213 


506 


AVTEEEMILNSLSLCYHNKLILAPMVRVGTLPMRLLALDYGADI 
VYCEELIDLKMIQCKRWNEVLSTVDFVAPDDRWFRTCEREQN 
RWFQMGTS 


7082 


3 


1137 


APSRNTMLMAWCRGPVLLCLRQGLGTNSFLHGLGQEPFEGARSL 
CCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCL 

QAQSATEVEERHVSPSCSTSRERPFQAGELILAETGEGETKFKK 
LFRLNNFGLLNSNWGAVPFGKIVGKFPGQILRSSFGKQYMLRRP 
ALEDYWLMKRGTAITFPKDINMILSMMDINPGDTVLEAGSGSG 
GMSLFLSKAVGSQGRVISFEVRKDHHDLAKKl^YKHWRDSWKLSH 
VEEWPDNVDFIHKDISGATEDIKSLTFDAVALDMLNPHVTLPVF 
YPHLKHGGVCPVYWNITQVIELLD 


7083 


115 


541 


RSNAVQLTRMEYAMKSLSLLYPKSL3RHVSVRTSWTQQLLSEP 
SPKAPRARPCRVSTADRSVRKGIMAYSLEDLLLKVRDTLMLADK 
PFFLVLEEDGTTVETEEYt'QALAGDTVFMVLQKGQKWQPPSEQG 
TRHPLSLSHK 


7084 


3 


522 


NSVSVSSQSRFIASVPGTGVQRSAAADMAASTAAGKQRIPKVAK 
VKNKAPAEVQITAEQLLREAKERELELLPPPPQQKITDEEELND 

RALDVDYRNITl.IF AEI I 1 1 T R97MIARNIWDRAITTL 


7085 


243 


1439 


RQLARLP-F PF( LAP ITINQYLQQVYEAIDSRDGASC 
AELVSFKHPHVANPRLQMASPEEKCQQVLEPPYDEMFAAHLRCT 
YAVGNHDFIEAYKCQTV1VQSFLRAFQAHKEENWALPVMYAVAL 

AGIEDSKKWGMLFLVNQLFKIYFKINKLHLCKPLIRAIDSSNLK 
DDYSTAQRVTYKYYVGRKAMFDSDFKQAEEYLSFAFEHCHRSSQ 
KNKRMILIYLLPVKMLLGHMPTVELLKKYHLMQFAEVTRAVSEG 
NLLLLHEALAKHEAFFIRCGI FLI LEKLKI ITYRNLFKKVYLLL 
KTHQLS LDAFLVALKFMQVEDVD I DEVQCILANL I YMGHVKGYI 
EHQHQKLWSKQNPFPPLSTGC 


703S 


25G 


525 


ILAARMGKQNSKLRPEVMQDLLESTDFTEHEIQEWYKGFLRDCP 
SGHLSMEEFKKIYGNFFPYGDASKFAEHVFRTFDANGDGTIDFR 
EF 


7087 


166 


723 


" ; .-.3KVAAPCVPPSNHELVPITTENAPKNWDKGEGASRGG 
NTRKSLEDNGSTRVTPSVQPHLQPIRNMSVSRTMEDSCELDLVY 
VTERIIAVSFPSTANEENFRSNLREVAQMLKSKHGGNYLLFNLS 
ERRPDITKLHAKVLEFGWPDLHTPALEKICSICKAMDTWLNAHP 
HRCRVLHNKG 


7088 


104 


759 


GTSAASPSSLLEMAGEITETGELYSSYVGLVYMFNLIVGTGALT 
MPKAFATAGWLVSLVLLVFLGFMSFMTTTFVIEAMAAANAQLHW 
KRMENLKEEEDDDSSTASDSDVLIRDNYERAEKRPILSVQRRGS 
PNPFEITDRVEMGQMASMFFNKVGWLFYFCIIVYLYGDIAIYA 
AWPFS^MQVTCSATGNDSCGVEADTKYNDTDRCWGPLRRVD 


7089 


33 


1775 


SVCWEDRYLKARMSESPLSRAPSRGGVNFLNVARTYIPNTKVEC 
HYTLPPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 
SPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQ3PPFQFRE 
PRPMDELVTLEEADGGSD1LLWPKATVLQNQLDESQQERNDLM 
QLKLQLEGQVTELRSRVQELERALATARQEHTELMEQYKGISRS 
HGEITEERDILSRQQGDHVARILELEDDIQTISEKVLTKEVELD 
RLRDTVKALTREQEKLLGQLKEVQADKEQSEAELQVAQQENHHL 
NLDL 1 i EF.QSAQA LKDKVAQMKDTLGQAQQRVAELEP 
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SEQ 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 

location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , >J=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=poesible nucleotide deletion, 
\=possible nucleotide insertion) 








LKEQLRGAQELAASSQQKATLLGEELASAAAARDRTIAKLHRSR 
LEVAEVMGKLAELGLHLK3EKCQWSKERAGLLQSVEAEKDKILK 
LSAEILRLEKAVQEERTQNQVFKTELAREKDSSLVQLSESKREfi 
TELRSALRVLQKEKEQLQEEKQELLEYMEKLEARLEKVADEKKN 
EDATTEDEEAAVGLSCPAALTDSEDESPEDMRLHPMAFVSVETQ 


7090 


33 


1775 


SVCWEDRYLKARMEESPLSRAPSRGGVNFLNVARTYIPNTKVEC 
HYTLPPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 
S P IHTS VQ FQAS YLPKPGAQLYQFRYVNRQGQVC3QSP P FQFRE 
PRPMDELVTLEEADGGSDILLWPKATVLQMQLDESQQERNDLM 
QLKLQLEGQVTELRSRVQELERALATARQEHT3LMEQYKGISRS 
HGE I TEERDI LSRQQGDHVAR I LEL3DDI QTI SEKVLTKE VELD 
RLRDTVKALTREQEKLLGQIiKEVQADKEQSEAELQVAQQENHHL 
NLDLKEAKSWQEEQSAQAQRLKDKVAQMKDTLGQAQQRVAELEP 
LKEQLRGAQELAASSQQKATLLGEELASAAAARDRTIAELHRSR 
LKV7vE\^GKLASLGLHLFCZ5fOCQl'/SXERAGLLG3ViiAJJKJJ_ [ CI7 K 
LSAEILRLEKAVQEERTQNQVFKTELAREKDSSLVQLSESKREL 

EDATTEDEEAAVGLSCPAALTDSEDESPEDMRLHPKAFVSVETQ 
ASLLLGLE 


7091 


186 


1076 


EGMLTREHRCGRSEEQELEPWPSPKKARSGRWLRNGFKRKMEEP 

ALHKQMRIVKPKVASMEEI1ATFHTDAYLQHLQKVSQEGDDDHPD 
SIEYGLGYDCPATEGIFDYAAAIGGATI1AAQCLIDGMCKVAIN 
WSGGWHHAKKDEASGFCYLNDAVLGILRLRRKFERILYVDLDLH 
HGDGVEDAFSFTSKVMTVSLHKFSPGFFPGTGDVSDVGLGKGRY 
YSVNVPIQDGIQDEKYYQICERYEPPAPNPGL 


7092 






KQGINEDQEESQKPRLGEGCEPISKRQMKKLIKQKQWEEQRELR 
KQKRKEKRKRKKLERQCQMEPNSDGHDRKRVRRDWHSTLRLII 
DCSFDXLM 


7093 


454 


655 


NFGVSGVELAQQASMVRMSFVIAACQLVLGLLMTSLTESSIQNS 
ECPQLCVCE I RPWFTPQSTYREA 


7094 


2 


508 


FVRSMHWGVGFASSRPCWDLSWNQSI5FFGWWAGSEEPFSFYG 
DIIAFPLQDYGGIMAGLGSDPWWKKTLYLTGGALLAAAAYLLHE 
LLVIRKQQEIDSKDAIILHQFARPNNGVPSLSPFCLKNETYLRM 
ADLPYQNYFGGKLSAQGKMPWIEYNHEKVSGTEFII 


7095 


1 




IASSLPKMASLLQSDRVLYLVQGEKKVRAPLSQLYFCRYCSELR 
SLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANCFDCPGCMH 
TLSTRATSISTQLPDDPAKTTMKKAYYLACGFCRWTSRDVGMAD 


709S 


224 


2067 


ETRSLAVQEKPSQAGRRRSSRISFAGALFLTRFLLQELLLNNFC 
SAMSPAPDAAPAPASISLFDLSADAPVFQGLSLVSHAFGEALAR 
APRTSCSGSGERESPERKLLQGPMDISEKLFCSTCDQTFQNHQE 
QREHYKLDWHRFNLKQRLKDKPLLSALDFEKQSSTGDLSSISGS 
EDSDSASEEDLQTLDRERATFEKLSRPPGFYPHRVLFQNAQGQF 
LYAYRCVLGPHQDPPEEAELLLQNLQSKGPRDCWLMAAAGHFA 
GAIFQGREWTHKTFHRYTVRAKRGTAQGLRDARGGPSHSAGAN 
LRRYNEATLYKDVRDLLAGPSWAKALEEA3TILLRAPRSGRSLF 
FGGKGAPLQRGDPRLWDIPLATRRPTFQELQRVLHKLTTLHVYE 
EDPREAVRLHSPQTHWKTVREERKKPTEEEIRKICRDEKEALGQ 
NEESPKQGSGSEGEDGFQVELELVELTVGTLDLCESEVLPKRRR 
RKRNKKE KSRDQE AGAHRTLLQQTQE EEPSTQSS QAVAAPLGP L 
LDEAKAPGQPELWNALLAACRAGDVGVLKLQLAPSPADPRVLSL 
L3APLGSGGFTLLHAAAAAGRGSWRLLLEAGADPTVQCQDH 


7097 


25S 


1228 


IRTKSAATWSAWPQCGREGSRIITEPCEANAGSRQELQTERISa 
FtAAQGDQAFHSGLETNNSNSELPLRVGLKVAQGSPLMGGQVSA 
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SEQ 


Predicted 

nucleotide 
location 
corresponding 
to first 
amino acid 


Predicted end 

c o r r e spondi ng 
amino acid 


Amino acid segment containing signal peptide 

Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagir_e, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








SNSFSRLHCRNANEDWMSALCPRLWDVPLHHLSIPGSHDTMTYC 
LNKKSPISHEESRLLQLLNKALPCITRPWLKWSVTQALDVTEQ 
LDAGVRYLDLRIAHMLEG3EKNLHFVHMVYTTALVEDTLTEISE 
WLERHPREWIIACRNFEGLSEDLHEYLVACIKNIFGDMLCPRG 

TEALIRYLETMKSCGR 


7098 


82 


956 


SSFLKRCRKVLGCWGIPSEQSLFSTLEEPRDKEIDNYCVMRLQT 
EARSGFWAPNRFPVNICRMTAVDGDRGGSSRETCRCHFHPSLEA 

LGSSVIYFLWNLLLLWPRVLAVALFSALFPSYVALHFLGLWLVL 
LLWVWLQGTDFMPDPSSEWLYRVTVATILYFSWFNVAEGRTRGR 
AIIHFAFLLSDSILLVATWVTHSSWLPSGIPLQLWLPVGCGCFF 


7099 


992 


210 


LFRLAPGFLRSLARQGYHQIWAFPF1PSGATATWPAASRSRSLA 
ARSLPRSPARPGPNDALLGEHDFRGQGVRAQRFRFSEEPGPGAD 
GAVLEVHVPQIGAGVSLPGILAAKCGAEVILSDS3ELPHCLEVC 

EDFEDILATIYFLMHKNPKVQLWSTYQVRSADWSLEALLYKMDM 
KCVHIPLESFDADKEDIAESTLPGRHTVEMLVISFAKDSL 








FLFSFPPQLGDPLEAFPVFKKYDRNGLNVSIECKRVSGLEPATV 
DWAFDLTKTNMQTMYEQSEWGWKDREKREEMTDDRAWYLIAWEN 
SSVPVAFSHFRFDVERGDEVLYW 


7101 


2 


503 


WRGGPRRAKRLAGGAVGWVLLVRGVHSVRAGGGRPPRAADMKKD 
VRILLVGEPRVGKTSLIMSLVSEEFPEEVPPRAEEITIPADVTP 
ERVPTHIVDYSEAEQSDEQLHQEISQANVICIVYAVNNKHSIDK 


7102 




503 


WRGGPRRAKRLAGGAVGWVLLVRGVHSVRAGGGRPPRAADMKKD 
VRILLVGEPRVGKTSLIMSLVSEEFPE3VPPRAEEITIPADVTP 
ERVPTHIVDYSEAEQSDEQLHQE1SQANVICIVYAVNNKHSIDK 
VTSRWIPLINERTDKDSRLPLILGGIIKSDLVEYSR 








LSDKGSELKKSFDAWFDVLKVTPEEYAGQITLMDVPVFKAIQP 
DELSSCGWNKKEKYSSAP 


7104 


1670 


795 


RIiWEHRSVSAGASGWGLSSPGCLLLHPSliPEEERVDILINNAGV 
MRCPHWTTEDGFEMQFGVMHLGEAWAGAAPWVQAILPRRPPKVL 
GF*V*VKSDLF11LNPGHFLLTNLLLDKLKASAPSRIINLSSLA 
HVAGHIDFDDLNWQTRKYNTKAAYCQS\KLAIVLFTKELSRRLQ 
GSGVTVNALHPGVARTELGRHTGIHGSTFLQHHN\WAHLLAAWS 

EEVARRLWAESARLVGLEAPSVRSQPLPR 


7105 


765 


143 


GOMCRRPSPKSTSCLSMTCDLP/RGLQDPQCLALFRVAVDKHQA 
LLKAAMSGQGVDRHLFALYIVSRFLHLQSPFLTQVHSEQWQLST 
SQIPVQQMHLFDVHNYPDYVSSGGGFGPADDHGYGVSYIFMGDG 
MITFHISSKKSSTKTDSHRLGQHIEDALLDVAELFQAGQHFKRR 
FRGSGKENSRHRCGFLSRQTGASKASMTSTDF 


710S 


14 


1064 


GLQAGHPHPRSASRIPEADTH\YSKLQRAFDSIWKDHEO?MFGT 
YFRVGFFGSKFGDLDEQEFVYKEPAITKLPEISHRLEAFYGQCF 
GAEFVEVI KDSTPVDKTKLDPNKAYI Q I TFVEP YFDEYEMKDRV 
TYFEKNFHLRRFMYTTPFTLEGRPRGELHEQYRRNTVLTTMHAF 
PYIKTRISVIQKEEFVLTPIEVAIEDHKKKTLQLAVAI^QEPPD 
AKMLQMVLQGSVGATTOQGPLEVAQVFLAEIPADPKLYRHHNKL 
RLCFKEFIMRCGEAVEKNKRLITADQREYQQELKKNYNKLKENL 
RPMIERKI PELYKP I FRVBSQKRDS FHRS S FRKCETQLSQGS 
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591 
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nucleotide 

amino acid 


Ammo acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion,, 
\=possible nucleotide insertion) 








FLGRFTGHPGAYLRLINRWRLEECHPSGCLIDLCMQMGIIMVLK 
QTWHNFMELGYPLIQNWWTRRKVRQEHGPERKISFPQWEKDYNL 
QPMNAYGLFDEYLEMILQFGFTTIFVAAFPIAPLIALLNNIIEI 
RLDAYKFVTQWRRPLASRAH3IGIWYGILEGIGILSVITNAFVI 
AITSDFIPRLVYAYKYGPCAGQGEAGQKCMVGYVNASLSVFRIS 
DFENRSEPESDGSEFSGTPLKYCP.YRDYRDPPHSLVPYGYTLQF 
WHVLAW 


7109 


964 


102 


WDQRKRNSLVPGPAHGPAQEEPWEKKESLGAAQEALSIQLQPKE 

SEXLATDTSTFEATSEGTLELQQRN PKAERLRWS PAQEES FRQM 
WIHKEIPTGKKDHECSECGKTFIYNSHLVVHQRVHSGEKPYKC 
SDCGKTFKQS SWLGQHQRIHTGEKP FECNECGKA^RWGAHLVQH 
QRIHSGEKPYECNECGKAFSQSSYLSQHRRIHSGEKPFICKECG 
KAYGWCSELIRHRRVHARKEPSH 


m. 


96 


.„ 


RLDNFSGFLVEVTKEERHIVPELjI l — i , ! 1ASITPVLG 
SPSTKRRQQNLQPIIEGETAHFFEEIKEEEEDGVNLSSELGDML 
KTAVQVQSSLKIJSHSDVEENQEKIALDLRLSSSRAASMPELLEQ 
LWKARA3KKKLRKTLREFEEAFYQQNGRNAQKEDRVPVLEEYRE 


7111 


2 


414 


GSGLYRGPTPGGQCIWKPNSMPPDHERNFGFTQFALELNEI.TAE 
LKRSIiPSTDTRLRPDQRYLEEGNIQAASAQKRRIEQLQRDRRKV 
MEENKIVHQARFFRRQTDSSGKEWWVTNNTYWRLRAEPGYGNMD 
GAVLW 


7112 


103 


495 


PRCFPVADRGRLIGGLPDWTIMEGKTLNLTCTVFSNPDPisVIi-J 
FKNDQDIQLSEHFSVKVEQAKYVSMTIXGVTSEDSGKYSINIKN 
KYGGEKIDVTVSVYKHGEKIPDMAPPQQAKPKLIPASASAAGQ 


7113 


1 


324 


KCLRQAWHEAPSSLAFTRWCSREERAEGGGNLHRSITRDPKPPG 
LRPSQRPMDDKKKKRSPKPCLAQPAQAPGTLRRVPVPTSHSGSL 
ALGLPHLPSPKQRAKFKRVGKEKGRPVLAGGGSGSAGTPLQHSF 
LTEVTDVYEMEGGLLNLLNDFHSGRLQAFGKE CS FEQLEHVREM 

DQLLSNLGSCLGALVPGGMRGGEGTYSQSHSWALGEKVGVHGEK 
SSGPLNLPRR 


7114 


3 


1492 


VWEVDEQIDHYKESQDKFLWQAAFIGKETLKDESGQECKICRKI 
IYLNTDFVSVKQRLPKYYSWERCSKHHLNFLGQNRSYVRKKDDG 
CKAYWKVCLHYNLHKAQPAERFFDPNQRGKALHQKQALRKSQRS 
QTGEKLYKCTECGKVFIQKANLWHQRTHTGEKPYECCECAKA? 
SQKSTLIAHQRTHTGEKPYECSECGKTFIQKSTLIKHQRTHTGE 

RIHTSEKPQCSEHGKASDEKPSPTKHWRTHTKENIYECSKCGKS 
FRGKSHLSVHQRIHTGEKPYECSICGKTFSGKSHLSVHHRTHTG 
EKPYECRRCGKAFGEKSTLIVHQRMHTGEKPYKCNECGKAFSEK 
SPLIKHQRIHTGERPYECTDCKKAFSRKSTLIKHQRIHTGEKPY 
KCSECGKAFSVKSTLIVHHRTHTGEKPYECRDCGKAFSGKSTLI 
KHQRSHTGDKNL 


7115 


1 


947 


NAAHG YNWGLW CM Y 1 1 PPQDWLDRGDESAP I RT PAM I GCS F WD 
REYFGDIGLLDPGMEVYGGENVKLGMRVWQCGGSMEVLPCSRVA 
HI ERTRKP YNND IDYYAKRNALRAAEVWMDDFKSHVYMAWNI PM 
SNPGVDFGDVSERLALRQRLKCRSFKWYLENVYPEMRVYKNTLT 
YGEVRNSKASAYCLDQGAEDGDRAILYPCHGMSSQLVRYSADGL 
LQLGPLGSTAFLPDSKCLVDDGTGRMPTLKKCEDVARPTQRLWD 
FTQSGPIVSRATGRCLEVEMSKDANFGLRLWQRCSGQKWMIRN 
WIKHARH 


7116 


866 


95 


RVRMRRNAEVIEEKLSMKSWAKFRPGEPWKGYPNIDPETDPYV? 
PGSVINNLSINTWEVDHLRDRNSGSSSSLNTTLPSTSAWSSIR 
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